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NOVEL HUMAN POLYNUCLEOTIDES AND THE 
POLYPEPTIDES ENCODED THEREBY 



This application claims priority to United States Provisional Application No. 
60/104,292, filed October 14, 1998, which is also incorporated herein by reference for any 
purpose. 

1. FIELD OF THE INVENTION 

The present invention is in the field of molecular genetics. The application discloses 
novel nucleic acid sequences that partially define the scope of human exons that can be 
trapped and identified by the disclosed vectors/methods, and which are useful, inter alia, for 
identifying the organization of the coding regions and of the human genome. 

2. BACKGROUND OF THE INVENTION 

The Human Genome Project and privately financed ventures are currently sequencing 
the human genome, and the substantial completion of this milestone is expected before the 
year 2003. The hope is that, at the conclusion of the sequencing phase, a comprehensive 
representation of the human genome will be available for biomedical analysis. However, the 
data resulting from such efforts will largely comprise human genomic sequence of which 
only a fraction actually encodes expressed sequence information. Although sophisticated 
computer-assisted exon identification programs can be applied to such genomic sequence 
data, the computer predictions require verification by laboratory analysis to actually identify 
the coding regions of the genome. Consequently, the availability of cDNA information will 
significantly contribute to the value of the human genomic sequence since cDNA sequence 
provides a direct indication of the presence of transcribed sequences as well as the location of 
splice junctions. Thus, the sequencing of cDNA libraries to obtain expressed sequence tags 
(or ESTs) that identify exons expressed within a given tissue, cell, or cell line is currently in 
progress. As a consequence of these efforts, a large number of EST sequences are presently 
compiled in public and privately held databases. However, the present EST paradigm is 
inherently limited by the levels and extent of mRNA production within a given cell. A 
related problem is the lack of cDNA sources from specific tissue and developmental 
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expression profiles. In addition, some genes are typically only active under certain 
physiological conditions or are generally expressed at levels below or near the threshold 
necessary for cDNA cloning and detection and are therefore not effectively represented in 
current cDNA libraries. 
5 Researchers have partially addressed these issues by using phage vectors to clone 

genomic sequences such that internal exons are trapped (Nehls, et al, 1994, Current Biology, 
4(%>:983-989, and Nehls, et al, 1994, Oncogene, P:2169-2175). However, such libraries 
require the random cloning of genomic DNA into a suitable cloning vector in vitro, followed 
by reintroduction of the cloned DNA in vivo in order to express and splice the cloned genes 
1 0 prior to producing the cDNA library. Additionally, such methods can only "trap" the internal 
exons of genes. Consequently, genes containing a single exon or a single intron are typically 
not trapped by traditional methods of exon trapping. 



3. SUMMARY OF THE INVENTION 

1 5 The subject invention provides numerous isolated and purified novel human cDNAs 

produced using gene trap technology. The novel human gene trapped sequences (GTSs) of 
the subject invention are disclosed as SEQ ID NOS:9-503 in the appended Sequence Listing. 

The subject invention further contemplates the use of one or more of the subject 
GTSs, or portions thereof, to isolate cDNAs, genomic clones, or full-length 

'20 genes/polynucleotides, or homologs, heterologs, paralogs, or orthologs thereof, that are 
capable of hybridizing to one or more of the disclosed GTSs or their complementary 

iJ sequences under stringent conditions. 

The subject invention additionally contemplates methods of analyzing biopolymer 
{e.g., oligonucleotides, polynucleotides, oligopeptides, peptides, polypeptides, proteins, etc.) 

25 sequence information comprising the steps of loading a first biopolymer sequence into or 
onto an electronic data storage medium {e.g., digital or analogue versions of electronic, 
magnetic, or optical memory, and the like) and comparing said first sequence to at least a 
portion of one of the polynucleotide sequences, or amino acid sequence encoded thereby, that 
is first disclosed in, or otherwise unique to, SEQ ID NOS:9-503. Typically, the 

30 polynucleotide sequences, or amino acid sequences encoded thereby, will also be present on, 
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or loaded into or onto a form of electronic data storage medium, or transferred therefrom, 
concurrent with or prior to comparison with the first polynucleotide. 

Another embodiment of the invention is the use of a oligonucleotide or polynucleotide 
sequence first disclosed in at least a portion of at least one of the GTS sequences of SEQ ID 
NOS:9-503 as a hybridization probe. Of particular interest is the use of such sequences in 
conjunction with a solid support matrix/substrate (resins, beads, membranes, plastics, 
polymers, metal or metallized substrates, crystalline or polycrystalline substrates, etc.). Of 
particular note are spatially addressable arrays (i.e., gene chips, microtiter plates, etc.) of 
polynucleotides wherein at least one of the polynucleotides on the spatially addressable array 
comprises an oligonucleotide or polynucleotide sequence first disclosed in at least one of the 
GTS sequences of SEQ ID NOS:9-503. 

Similarly, one or more oligonucleotide probes based on, or otherwise incorporating, 
sequences first disclosed in any one of SEQ ID NOS:9-503, can be used in methods of 
obtaining novel gene sequence via the polymerase chain reaction or by cycle sequencing. 
Similar oligonucleotide hybridization probes can also comprise sequence that is 
complementary to a portion of a sequence that is first disclosed in, or preferably unique to, at 
least one of the GTS polynucleotides in the sequence listing. The oligonucleotide probes will 
generally comprise between about 8 nucleotides and about 80 nucleotides, preferably between 
about 15 and about 40 nucleotides, and more preferably between about 20 and about 35 
nucleotides. 

Moreover, an oligonucleotide or polynucleotide sequence first disclosed in at least 
one of the GTS sequences of SEQ ID NOS:9-503 can be incorporated into a phage display 
system that can be used to screen for proteins, or other ligands, that are capable of binding an 
amino acid sequence encoded by an oligonucleotide or polynucleotide sequence first 
disclosed in at least one of the GTS sequences of SEQ ID NOS:9-503. 

An additional embodiment of the present invention is a library comprising 
individually isolated linear DNA molecules corresponding to at least a portion of the 
described human GTSs which are useful for synthesizing physically contiguous sequences of 
overlapping GTSs by, for example, the polymerase chain reaction (PCR). 
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The subject invention also provides for an antisense molecule which comprises at 
least a portion of sequence that is first disclosed in, or preferably unique to, at least one of the 
GTS polynucleotides. 

The subject invention also contemplates a purified polypeptide in which at least a 
5 portion of the polypeptide is encoded by, and thus first disclosed by, at least a portion of a 
GTS of the present invention. The invention also relates to naturally occurring 
polynucleotides comprising the disclosed GTSs that are expressed by promoter elements 
other than the promoter elements that normally express the GTSs in human cells (i.e., gene 
activated GTSs). Such promoter elements can be directly incorporated into the cellular 

10 genome or recombinantly engineered upstream from at least a portion of a GTS (preferably at 
least about 50, more preferably at least about 75, and most preferably at least about 100 to 
130 base in length) of the present invention, or a complement thereof. A particularly 
preferred embodiment includes recombinantly engineered expression vectors that similarly 
have or incorporate at least a, preferably unique, portion of the disclosed GTSs or 

15 complement thereof. 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 

The Sequence Listing is a compilation of nucleotide sequences obtained by 
sequencing a human gene trap library that at least partially identifies the genes in the target 

20 cell genome that can be trapped by the described gene trap vectors (i.e. , the repertoire of 
genes that are active or have not been inactivated). 

Figures 1 A- ID. Figure 1 A illustrates a retroviral vector that can be used to practice 
the described invention. Figure IB shows a schematic of how a typical cellular genomic 
locus is effected by the integration of the retroviral construct into intronic sequences of the 

25 cellular gene. Figure 1C shows the chimeric transcripts produced by the gene trap event as 
well as the locations of the binding sites for PCR primers. Figure ID shows how the PCR 
amplified cDNAs are directionally cloned into a suitable GTS vector. 
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5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to novel human polynucleotide sequences obtained 
from cDNA libraries generated by the normalized expression of genomic exons using gene 
trap technology. In particular, the disclosed novel polynucleotides were generated using a 
5 modified reverse-orientation retroviral gene trap vector that was nonspecifically integrated 
into the target cell genome, although other polynucleotide (DNA or RNA) gene trap vectors 
could have been introduced to the target cells by, for example, transfection, electroporation, 
or retrotransposition. Preferred retroviral vectors that can be used to practice the present 
invention (as well as methods and recombinant tools for making and using the described 

1 0 GTSs) are disclosed in, inter alia, U.S. Application Ser. No. 09/276,533 , filed March 25, 
1999 which is herein incorporated by reference in its entirety. 

After integration, the exogenous promoter of the sequence acquisition, or 3' gene trap, 
component of the vector was used to express and splice a chimeric mRNA that was 
subsequently reverse transcribed, amplified, and subject to DNA sequence analysis. Unlike 

1 5 conventional cDNA libraries, the presently disclosed libraries are largely unaffected by the 
bias inherent in cDNA libraries that rely solely on endogenous mRNA expression. 
Additionally, by integrating a vector into the target cell genes, a chimeric mRNA is produced 
that allows for the specific expansion and isolation of cDNAs corresponding to the chimeric 
mRNAs using vector specific primers. 

20 As used herein the term "gene trapped sequence", or "GTS", refers to nucleotide 

sequences that correspond to naturally occurring endogenously encoded human exons that 
have been expressed as part of a chimeric "gene trapped" mRNA. Typically, the chimeric 
mRNA incorporates at least a portion of sequence that has been engineered into the sequence 
acquisition exon of a gene trap vector which, inter alia, facilitates cDNA production by 

25 reverse transcriptase and amplification of the cDNA by PCR to produce an isolated linear 
DNA molecule. The disclosed GTSs do not include vector encoded sequences. 



The term "GTS" not only refers to polynucleotides that are exactly complementary to 
naturally occurring human mRNA, but also refers to "GTS derivatives". The term "GTS 
30 derivative" also refers to heterologs, paralogs, orthologs, and allelic variants of the specific 
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GTSs described herein. In addition, a GTS may include the complete coding region for a 
naturally occurring peptide or polypeptide. A GTS may also include a complete open reading 
frame. 

The term "GTS peptide" as used herein includes oligopeptides or polypeptides sharing 
5 biological activity and/or immunogenicity (or immunological cross-reactivity) with an amino 
acid sequence encoded by at least one of the disclosed GTSs or complement thereof. The 
terms "biological activity" (or "biological characteristics") of a polypeptide refers to the 
structural or biochemical function of the polypeptide in the normal biological processes of the 
organism in which the polypeptide naturally occurs. Examples of such characteristics include 

1 0 protein structure and/or conformation, which can be determined biochemically by reaction 
with appropriate ligands or receptors or by suitable biological assays. 

A GTS peptide may also correspond to a full-length naturally occurring peptide or 
polypeptide. GTS peptides can have amino acid sequences that directly correspond to 
naturally occurring polypeptides or amino acid sequences or can comprise minor variations. 

15 Such variations can include amino acid substitutions that are the result of the replacement of 
one amino acid with another amino acid having a similar structural and/or chemical 
properties, such as the substitution of a leucine with an isoleucine or valine, an aspartate with 
a.glutamate, or a threonine with a serine, i.e., conservative amino acid replacements. 
Additional variations include minor amino acid deletions and/or insertions, typically in the 

20 range of about 1 to 6 amino acids, and can also include one or more amino acid substitutions. 
Guidance in determining which GTS peptide amino acid residues can be replaced or deleted 
without abolishing the biological activity of interest may be determined empirically, or by 
using computer amino acid sequence databases to identify polypeptides that are homologous 
to a given GTS peptide and trying to avoid amino acid substitutions in conserved regions of 

25 homology. 

"Homology" refers to the similarity or the degree of similarity between a reference, or 
known polynucleotide and/or polypeptide and a test nucleotide sequence and/or its 
corresponding amino acid sequence. As used herein, "homology" is defined by sequence 
similarity between a reference sequence and at least a portion of the newly sequenced 
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nucleotide. Typically, a corresponding amino acid sequence similarity should exist between 
the peptides encoded by such homologous sequences. 

To determine whether proteins are homologous, the GTS sequence is translated into 
the corresponding amino acid sequence. The amino acid sequence is then compared with 
5 reference polypeptide sequences. A short string of matching amino acid sequence can 
constitute good evidence of homology (for example, repeating Gly-Pro-X sequence, or the 
presence of an RGD motif). However, typically a larger number of similar amino acids is 
required to label two sequences homologous. Generally, the match needs to be at least about 
7 or 8 amino acids, among which perhaps one mismatch is allowed. These criteria allow 
10 good sensitivity in finding all relevant sequences while providing a threshold amount of 
selectivity. 

After peptide homology has been found, the respective nucleotide sequences are 
compared. An alignment of the reference and new sequences should show at least about 
60%, and preferably at least about 65%, agreement over the minimum of 21 nucleotides 

1 5 which correspond to the 6 matching amino acids. Generally, a low percentage of agreement 
is acceptable if the differences are in the "wobble" position (or third nucleotide of the triplet 
coding for an amino acid). 

As used herein, a "mutated" polypeptide has an altered primary structure typically 
resulting from corresponding mutations in the nucleotide sequence encoding the protein or 

20 polypeptide. As such, the term "mutated" polypeptides can include allelic variants. 

Mutational changes in the primary structure of a polypeptide result from deletions, additions 
or substitutions. A "deletion" is defined as a change in a polypeptide sequence in which one 
or more internal amino acid residues are absent. An "addition" is defined as a change in a 
polypeptide sequence which has resulted in one or more additional internal amino acid 

25 residues as compared to the wild type. A "substitution" results from the replacement of one 
or more amino acid residues by other residues. A polypeptide "fragment" is a polypeptide 
consisting of a primary amino acid sequence which is identical to a portion of the primary 
sequence of the polypeptide to which the polypeptide is related. 

A host cell "expresses" a gene or DNA when the gene or DNA is transcribed into 

30 RNA that may optionally be translated to produce a polypeptide. 
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The subject invention also includes GTSs which are incorporated into expression 
vectors and transformed into host cells which subsequently express the polynucleotides 
and/or polypeptides encoded by the GTSs. 

The subject invention also includes antibodies capable of specifically binding to GTS 
peptides, as well as methods of detecting a GTS peptides or the corresponding protein by 
combining a sample for analysis with an antibody capable of specifically binding to a GTS 
peptide and detecting the formation of antibody complexes present in the sample. 

The subject invention also includes a method of isolating a GTS peptide, or its 
corresponding protein comprising the step of separating the GTS peptide, or its corresponding 
protein, from a solution utilizing an antibody capable of specifically binding to the GTS 
peptide or its corresponding protein. 

The subject invention also provides for markers for use in detecting diseases, 
biological events, cell types and tissues which comprise at least a portion of a GTS sequence. 

Further, the subject invention provides polynucleotide markers useful for physical and 
genetic mapping of the human, and/or certain model organism, genome(s). In particular, the 
nucleotide sequences in the Sequence Listing provide sequence tagged sites (STS), that will 
be useful in completing an STS-based physical map of the human genome, a goal of the 
human genome project (Collins, F. and Galas, D. (1993) Science 262:43-46). Additionally, 
some of these sequences will identify new genes. These new genes will be useful in 
completing physical and genetic maps of all the genes in the human genome, another goal of 
the human genome project. 

The exons contained in the disclosed GTSs contain open reading frames (present in 
one of the three reading frames in either orientation of the sequence). Typically, the gene trap 
strategy employed to generate the GTS sequences allows for the directional cloning and 
identification of the sense strand. However, it is possible that occasional sequencing errors or 
random reverse transcription, or PCR aberrations will mask the presence of the appropriate 
open reading frame. In such cases of sequencing error, it is possible to determine the 
corresponding GTS sequence by expressing the GTS in an appropriate expression system and 
determining the amino acid sequence by standard peptide mapping and sequencing 
techniques ( Current Protocols in Molecular Biology . John Wiley & Sons, Vol. 2, Sec 16, 
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1989). Additionally, the actual reading frame and amino acid sequence of a given nucleotide 
sequence may be determined by in vitro synthesis of a portion of an oligopeptide comprising 
a possible amino acid sequence and preparing antibodies to the oligopeptide. If the 
antibodies react with cells from which the GTS of interest was derived, the reading frame is 
likely correct. Alternatively, codon usage analysis can be used to track and correct reading 
frame shifts in gene sequence data. 

The correct amino acid sequence of a GTS protein is largely a function of the DNA 
sequence and the correct amino acid sequence can be readily determined using routine 
techniques. For example, by providing independent three fold sequencing coverage of the 
GTS library, random sequencing/RT/PCR errors can be identified and corrected by selecting 
the sequence represented by the majority of gene trap sequences covering a given nucleotide. 

The nucleotide sequences of the Sequence Listing may contain some sequencing 
errors and several of the nucleotide sequences of the Sequence Listing may contain 
nucleotides that have not been precisely identified, typically designated by an N, rather than 
A, T, C, or G. Since each of the nucleotide sequences presented in the Sequence Listing is 
believed to uniquely identify a novel GTS, any sequencing errors or N's in the nucleotide 
sequences of the Sequence Listing do not present a problem in practicing the subject 
invention. Several methods employing standard recombinant methodology, for example, as 
described in Molecular Cloning: Laboratory Manual 2nd ed„ Sambrook et al. (1989), Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY (or periodic updates thereof), may be 
used to correct errors and complete the missing sequence information. For example, a 
nucleotide and/or oligonucleotide corresponding to a portion of a nucleotide sequence of GTS 
of interest, can be chemically or biochemically synthesized in vitro, and used as a 
hybridization probe to screen a cDNA library in order to identify and obtain library isolates 
comprising recombinant DNA sequences containing the GTS cDNA sequence of interest. 
The library isolate may then be independently subjected to nucleotide sequencing using one 
or more standard sequencing procedures so as to obtain a complete and accurate nucleotide 
sequence. 

For the purposes of this disclosure, the term "isolated and purified polynucleotide" 
comprises a polynucleotide purified from a natural cell or tissue as well as polynucleotides 
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which are complementary to the polynucleotides isolated from the natural cell or tissue. One 
example of an isolated or purified polynucleotide, or a substantially isolated preparation 
thereof, is a preparation where the polynucleotide of interest represents at least about 80 
percent, preferably at least about 85 percent, and more preferably at least about 90 to 95 
percent or more of the net product(s) that can be visualized on a DNA agarose gel stained 
with ethidium bromide. 

The described GTSs were obtained from isolates of a cDNA library. Clones isolated 
from cDNA libraries generated by 3' gene trapping typically contain only a portion of the 
mature RNA transcript that has been spliced to a vector encoded sequence acquisition exon, 
and therefore such clones may only encode a portion of the polypeptide of interest (however, 
it should be appreciated that a number of the disclosed GTSs may encode full- length ORFs). 
To obtain the remainder of the sequence, the GTSs can be used as hybridization probes to re- 
screen the same or a different cDNA library, and additional clones isolated by the re- 
screening can be purified and characterized using standard methods (Benton and Davis, 1977, 
Science, 196: 180-1 83). Once sufficiently purified, the size of the DNA insert can be 
approximated by agarose gel electrophoresis and the larger clones can be analyzed to 
determine the exact number of bases by DNA sequencing. Frequently, the use of a library 
different from the one which contained the original clone is useful for this purpose, and 
particularly a library that has been prepared with extra care to extend cDNA synthesis to full- 
length, or a library that has been intentionally primed with random primers in order to "jump 
over" particularly difficult regions of the transcript sequence. 

Missing upstream DNA sequence can also be obtained by "primer extension" of the 
cDNA isolate, a practice common in the art (Sambrook et al. (1989), Molecular Cloning: 
Laboratory Manual 2nd ed. pg 7.79-7.83, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY), whereby a sequence-specific oligonucleotide is used to prime reverse- 
transcription near the 5'-end of the cDNA clone and the resulting product is either cloned into 
a bacterial vector or is analyzed directly by DNA sequencing. Finally, newer methods to 
extend clones in either direction employ oligonucleotide-directed thermocyclic DNA 
amplification of the missing sequences, wherein a combination of a cDNA-specific primer 
and a degenerate, vector-specific, or oligo-dT-binding second oligonucleotide can be used to 
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prime strand synthesis. In any of the above methods or other methods of detecting additional 
cDNA sequence, two or more resulting clones containing the partial cDNA sequence can be 
recombined to form a single full-length cDNA by standard cloning methods. The resulting 
full-length cDNA may subsequently be transferred into any of a number of appropriate 
expression vectors. 

In many instances, the sequencing of clones resulting from independent nonspecific 
gene trap events will result in a natural redundancy of sequencing more than one cDNA from 
a particular gene. As discussed above, this feature is a built in form of error detection and 
correction. These independent gene trap events can also be combined using the various 
overlapping regions of sequence into an entire contiguous sequence ("contig") containing the 
complete nucleotide sequence of the full length cDNA. Similar methodology can be used to 
combine one or more GTSs with one or more publicly available, or proprietary, ESTs to 
synthesize, electronically or chemically, a contiguous sequence. 

The ABI Assembler application, part of the INHERITS DNA analysis system 
(Applied Biosystems, Inc., Foster City, CA), creates and manages sequence assembly 
projects by assembling data from selected sequence fragments into a larger sequence. The 
Assembler combines two advanced computer technologies which maximize the ability to 
assemble sequenced DNA fragments into Assemblages, a special grouping of data where the 
relationships between sequences are shown by graphic overlap, alignment and statistical 
views. The process is based on the Meyers-Kececioglu model of fragment assembly 
(INHERITS™ Assembler User's Manual, Applied Biosystems, Inc., Foster City, CA), and 
uses graph theory as the foundation of a very rigorous multiple sequence alignment program 
for assembling DNA sequence fragments. Additional methods of using GTSs and obtaining 
full length versions thereof are discussed in U.S. Patent No. 5,817,479, herein incorporated 
by reference. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code (see, for example, Table 4-1 at page 109 of "Molecular Cell Biology", 1986, 
J. Darnell et al. eds., Scientific American Books, New York, NY, herein incorporated by 
reference) a multitude of GTS nucleotide sequences, some bearing minimal nucleotide 
sequence homology to the nucleotide sequence of genes naturally encoding GTS peptides, 
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can be produced. The invention has specifically contemplated each and every possible 
variation of nucleotide sequence that could be made by selecting combinations based on 
possible codon choices. These combinations are made in accordance with the standard triplet 
genetic code as applied to the nucleotide sequence of naturally occurring human GTS 
nucleotide sequences and all such variations are to be considered as being specifically 
disclosed. Once the triplet codons are "translated" (which can be done electronically) into 
their amino acid counterparts, the amino acid sequences encoded by the GTS ORFs 
effectively represent a generic representation of the various nucleotide sequences that can 
encode the amino acid sequence (i.e., each amino acid is generic for the various nucleotide 
codons that correspond to that amino acid). 

The presently described novel human GTSs provide unique tools for diagnostic gene 
expression analysis, for cross species hybridization analysis, for genetic manipulations using 
a variety of techniques, like, for example, antisense inhibition, gene targeting, the 
identification or generation of full-length cDNA, mapping exons in the human genome, 
identifying exon splice junctions, gene therapy, gene delivery, chromosome mapping, etc. 
Furthermore, the expression-based detection and isolation of the described novel 
polynucleotides verifies that the genes encoding these sequences have not been inactivated 
by, for example, the covalent modification (methylation, acetylation, glycosylation, etc.) of 
the target cell genome, or inhibiting the function of transcriptional control elements. The fact 
that the genes have not been inactivated in the target cell genome can indicate an involvement 
in cellular metabolism, catabolism, homeostasis, or any of a wide variety of developmental 
and cell differentiation processes or the regulation of physiological or endocrine functions in 
the body, etc. (although treating the target cell with, for example, histone deacetylators can 
partially compensate for such inactivation and expand the target size of a given trapping 
construct). These data are especially useful when correlated with cDNA data from 
differentiated tissues and/or cells or cell lines in order to determine whether the absence of 
expression is regulated at the level of transcription or gene inactivation. 
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5.1 POLYNUCLEOTIDES OF THE PRESENT INVENTION 

The nucleotide sequences of the various isolated human GTSs of the present invention 
appear in the Sequence Listing as SEQ ID NOS:9-503. Additional embodiments of the 
present invention are GTS variants, or homologs, paralogs, orthologs, etc., which include 
isolated polynucleotides, or complements thereof, that hybridize to one or more of the 
disclosed GTSs of SEQ ID NOS:9-503 under stringent, or preferably highly stringent, 
conditions. By way of example and not limitation, high stringency hybridization conditions 
can be defined as follows: Prehybridization of filters containing DNA to be screened is 
carried out for 8 h to overnight at 65 °C in a buffer containing 6X SSC, 50mM Tris-HCl (pH 
7.5), ImM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 peg/ml denatured salmon 
sperm DNA. Filters are hybridized for 48 h at 65 °C in prehybridization mixture containing 
100yug/ml denatured salmon sperm DNA and 5-20 x 10 6 cpm of 32 P-labeled probe 
(alternatively, as in all hybridizations described herein, approximately 42, 44, 46, 48, 50, 52, 
54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can be used). The filters are then 
washed in approximately IX wash mix (10X wash mix contains 3M NaCl, 0.6M Tris base, 
and 0.02M EDTA, alternatively, as with all washes described herein, 2X, 3X, 4X, 5X, 6X 
wash mix, or more, can be used) twice for 5 minutes each at room temperature, then in IX 
wash mix containing 1% SDS at 60°C (alternatively, as in all washes described herein, 
approximately 42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or 
more can be used) for about 30 min, and finally in 0.3X wash mix (alternatively, as in all 
final washes described herein, approximately, 0.2X, 0.4X, 0.6X, 0.8X, IX, or any 
concentration between about 2X and about 6X can be used in conjunction with a suitable 
wash temperature) containing 0.1% SDS at 60°C (alternatively, approximately 42, 44, 46, 48, 
50, 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can be used) for about 30 
min. The filters are then air dried and exposed to x-ray film for autoradiography. In an 
alternative protocol, washing of filters is done at 37 °C for 1 h in a solution containing 2X 
SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA. This is followed by a wash in 0.1X SSC at 
50°C for 45 min before autoradiography. Another example of hybridization under highly 
stringent conditions is hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium 
dodecyl sulfate (SDS), 1 mM EDTA at 65 °C, and washing in 0.1xSSC/0.1% SDS at 68 °C 
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(Ausubel F.M. et al, eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green 
Publishing Associates, Inc., and John Wiley & sons, Inc., New York, at p. 2.10.3). 

Preferably, such GTS variants will encode at least a portion or domain of a, preferably 
naturally occurring, protein or polypeptide that encodes a functional equivalent to a protein or 
polypeptide, or portion or domain thereof, encoded by the disclosed GTSs. Additional 
examples of GTS variants include polynucleotides, or complements thereof, that are capable 
of binding to the disclosed GTSs under less stringent conditions, such as moderately stringent 
conditions, (e.g., washing in 0.2xSSC/0.1% SDS at 42° C (Ausubel et al, 1989, supra). 
Moderately stringent conditions can be additionally defined, for example, as follows: Filters 
containing DNA are pretreated for 6 h at 55 °C in a solution containing 6X SSC, 5X 
Denhart's solution, 0.5% SDS and 100 ug/ml denatured salmon sperm DNA. Hybridizations 
are carried out in the same solution and 5-20 x 10 6 cpm 32 P-labeled probe is used. Filters are 
incubated in hybridization mixture for 18-20 h at 55 °C (alternatively, as in all hybridizations 
described herein, approximately 42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 
72 degrees or more can be used in combination with a suitable concentration of salt). The 
filters are then washed in approximately IX wash mix (10X wash mix contains 3M NaCl, 
0.6M Tris base, and 0.02M EDTA, alternatively, as with all washes described herein, 2X, 3X, 
4X, 5X, 6X wash mix, or more, can be used) twice for 5 minutes each at room temperature, 
then in IX wash mix containing 1% SDS at 60 °C (alternatively, as in all washes described 
herein, approximately, 42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 72 
degrees or more can be used) for about 30 min, and finally in 0.3X wash mix (alternatively, 
as in all final washes described herein approximately 0.2X, 0.4X, 0.6X, 0.8X, IX, or any 
concentration between about 2X and about 6X can be used in conjunction with a suitable 
wash temperature) containing 0.1% SDS at 60°C (alternatively, approximately 42, 44, 45, 48, 
50, 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can be used) for about 30 
min. The filters are then air dried and exposed to x-ray film for autoradiography. 

In an alternative protocol, washing of filters is done twice for 30 minutes at 60°C in a 
solution containing IX SSC and 0.1% SDS. Filters are blotted dry and exposed for 
autoradiography. 

Other conditions of moderate stringency which may be used are well-known in the art. 
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For example, washing of filters can be done at 37°C for 1 h in a solution containing 2X SSC, 
0.1% SDS. Another example of hybridization under moderately stringent conditions is 
washing in 0.2xSSC/0.1% SDS at 42°C (Ausubel et al., 1989, supra). Such less stringent 
conditions may also be, for example, low stringency hybridization conditions. By way of 
5 example and not limitation, procedures using such conditions of low stringency are as follows 
(see also Shilo and Weinberg, 1981, Proc. Natl. Acad. Sci. USA 78:6789-6792): Filters 
containing DNA are pretreated for 6 h at 40°C in a solution containing 35% formamide, 5X 
SSC, 50mM Tris-HCl (pH 7.5), 5mM EDTA, 0.1% PVP, 0.1% Ficoll, 1% BSA, and 500 
Aig/ml denatured salmon sperm DNA. Hybridizations are carried out in the same solution 

10 with the following modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100Mg/ml salmon 
sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 cpm 32 P-labeled probe is used. 
Filters are incubated in hybridization mixture for 18-20 h at 40°C (alternatively, as in all 
hybridizations described herein, approximately 42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 66, 
68, 70, or about 72 degrees or more can be used). The filters are then washed in 

1 5 approximately IX wash mix (lOx wash mix contains 3M NaCl, 0.6M Tris base, and 0.02M 
EDTA, alternatively, as with all washes described herein, 2X, 3X, 4X, 5X, 6X wash mix, or 
more, can be used) twice for five minutes each at room temperature, then in IX wash mix 
containing 1% SDS at 60°C (alternatively, as in all washes described herein, approximately 
42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can be used) 

20 for about 30 min, and finally in 0.3X wash mix (alternatively, as in all final washes described 
herein, approximately, 0.2X, 0.4X, 0.6X, 0.8X, IX, or any concentration between about 2X 
and about 6X can be used in conjunction with a suitable wash temperature) containing 0.1% 
SDS at 60°C (alternatively, approximately 42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 66, 68, 
70, or about 72 degrees or more can be used) for about 30 min. The filters are then air dried 

25 and exposed to x-ray film for autoradiography. In yet another alternative protocol, washing 
of filters is done for 1.5 h at 55 °C in a solution containing 2X SSC, 25mM Tris-HCl (pH 
7.4), 5mM EDTA, and 0.1% SDS. The wash solution is replaced with fresh solution and 
incubated an additional 1 .5 h at 60 °C. Filters are then blotted dry and exposed for 
autoradiography. If necessary, filters are washed for a third time at 65-68 °C and reexposed to 

30 film. Other conditions of low stringency which may be used are well known in the art {e.g., 
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as employed for cross-species hybridizations). Preferably, GTS variants identified or isolated 
using the above methods will also encode a functionally equivalent gene product (i.e., 
protein, polypeptide, or domain thereof, encoding or otherwise associated with a function or 
structure at least partially encoded by the complementary GTS). 

Additional embodiments contemplated by the present invention include any 
polynucleotide sequence comprising a continuous stretch of nucleotide sequence originally 
disclosed in, or otherwise unique to, any of the GTSs of SEQ ID NOS:9-503 that are at least 
8, or at least 10, or at least 14, or at least 20, or at least 30, or at least about 40, and preferably 
at least about 60 consecutive nucleotides up to about several hundred bases of nucleotide 
sequence or an entire GTS sequence. Functional equivalents of the gene products of SEQ ID 
NOS:9-503 include naturally occurring variants of SEQ ID NOS:9-503 present in other 
species, and mutant variants, both naturally occurring and engineered, which retain at least 
some of the functional activities of the gene products of SEQ ID NOS:9-503. 

The invention also includes degenerate variants of the claimed GTS sequences, and 
products encoded thereby. Such variants may be 80% identical to any one of SEQ ID NOS: 
9-503, more preferably 85%, more preferably 90%, more preferably 95% and most preferably 
98% identical. The degree of identity (or the degree of homology) of a polynucleotide 
sequence to any one of SEQ ID NOS: 9-503 may be determined using any sequence analysis 
program known in the art, for example, the University of Wisconsin GCG sequence analysis 
package, SEQUENCHER 3.0, Gene Codes Corp., Ann Arbor, MI. The invention further 
includes GTS derivatives wherein any of the disclosed GTSs, or GTS variants, is linked to 
another polynucleotide molecule, or a fragment thereof, wherein the link may be either 
directly or through other polynucleotides of any sequence and of a length of about 1,000 base 
pairs, or about 500 base pairs, or about 300 base pairs, or about 200 base pairs, or about 150 
base pairs, or about 100 base pairs or about 50 base pairs, or less. 

The invention also particularly includes polynucleotide molecules, including DNA, 
that hybridize to, and are therefore the complements of, the nucleotide sequences of the 
disclosed GTSs. Such hybridization conditions may be highly stringent or less highly 
stringent, as described above. In instances wherein the nucleic acid molecules are 
deoxyoligonucleotides ("DNA oligos"), highly stringent conditions may refer to, for example, 
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washing in 6xSSC/0.05% sodium pyrophosphate at 37° C (for oligos having 14-base DNA 
oligos), 48° C (for 17-base DNA oligos), 55° C (for 20-base DNA oligos), and 60°C (for 23- 
base oligos). Similar conditions are contemplated for RNA oligos corresponding to a portion 
of the disclosed GTS sequences. 

These nucleic acid molecules may encode or act as antisense molecules to 
polynucleotides comprising at least a portion of the sequences shown in SEQ ID NOS:9-503 
that are useful, for example, to regulate the expression of genes comprising a nucleotide 
sequence of any of SEQ ID NOS:9-503, and can also be used, for example, as antisense 
primers in amplification reactions of gene sequences. With respect to gene regulation, such 
techniques can be used to regulate, for example, developmental processes by modulating the 
expression of genes in embryonic stem cells. Further, such sequences may be used as part of 
ribozyme and/or triple helix sequences that can be used to regulate gene expression. Still 
further, such molecules may be used as components of diagnostic methods whereby, for 
example, the presence of a particular allele, of a gene that contains any of the sequences of 
SEQ ID NOS:9-503 may be detected. Of particular interest is the use of the disclosed GTSs 
to conduct analysis of single nucleotide polymorphisms (SNPs), and particularly coding 
region SNPs or "cSNPs", in the human genome, or as general or individual-specific forensic 
markers. When so applied, a "collection of GTSs is obtained from an individual, and screened 
against a control database of cSNPs (or other genetic markers) that have previously been 
associated with disease, suitability or susceptibility (or sensitivity) to specific drugs or 
therapies, or virtually any other human trait that correlates with a given cSNP or genetic 
marker, or assortment thereof. In addition to disease/diagnostic testing, the described GTSs 
are also useful as genetic markers for the prenatal analysis of congenital traits or defects. 

In addition to the nucleotide sequences described above, full length cDNA or gene 
sequences that contain any of SEQ ID NOS:9-503 present in the same species and/or 
homologs of any of those genes present in other species can be identified and isolated by 
using molecular biological techniques known in the art. 

In order to clone the full length cDNA sequence from any species encoding the cDNA 
corresponding to the entire messenger RNA or to clone variant or heterologous forms of the 
molecule, labeled DNA probes made from nucleic acid fragments corresponding to any of the 
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partial cDNA disclosed herein may be used to screen a cDNA library. For example, 
oligonucleotides corresponding to either the 5' or 3' terminus of the cDNA sequence may be 
used to obtain longer nucleotide sequences. Briefly, the library may be plated out to yield a 
maximum of about 30,000 pfu for each 150 mm plate. Approximately 40 plates may be 
screened. The plates are incubated at 37° C until the plaques reach a diameter of 0.25 mm or 
are just beginning to make contact with one another (3-8 hours). Nylon filters are placed onto 
the soft top agarose and after 60 seconds, the filters are peeled off and floated on a DNA 
denaturing solution consisting of 0.4N sodium hydroxide. The filters are then immersed in 
neutralizing solution consisting of 1 M Tris HC1, pH 7.5, before being allowed to air dry. 
The filters are prehybridized in casein hybridization buffer containing 10% dextran sulfate, 
0.5 M NaCl, 50 mM Tris HCL, pH 7.5, 0.1% sodium pyrophosphate, 1% casein, 1% SDS, 
and denatured salmon sperm DNA at 0.5 mg/ml for 6 hours at 60° C. The radiolabeled 
probe is then denatured by heating to 95° C for 2 minutes and then added to the 
prehybridization solution containing the filters. The filters are hybridized at 60° C 
(alternatively, as in all hybridizations described herein, approximately 42, 44, 46, 48, 50. 52, 
54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can be used) for about 16 hours. 
The filters are then washed in approximately IX wash mix (10X wash mix contains 3M 
NaCl, 0.6M Tris base, and 0.02M EDTA, alternatively, as with all washes described herein, 
2X, 3X, 4X, 5X, 6X wash mix, or more, can be used) twice for 5 minutes each at room 
temperature, then in IX wash mix containing 1% SDS at 60° C (alternatively, as in all 
washes described herein, approximately 42, 44, 46, 48, 50. 52, 54, 56, 58, 62, 64, 66, 68, 70, 
or about 72 degrees or more can be used) for about 30 min, and finally in 0.3X wash mix 
(alternatively, as in all final washes described herein, approximately, 0.2X, 0.4X, 0.6X, 0.8X, 
IX, or any concentration between about 2X and about 6X can be used in conjunction with a 
suitable wash temperature) containing 0.1% SDS at 60° C (alternatively, approximately 42, 
44, 46, 48, 50. 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can be used) for 
about 30 min. The filters are then air dried and exposed to x-ray film for autoradiography. 
After developing, the film is aligned with the filters to select a positive plaque. If a single, 
isolated positive plaque cannot be obtained, the agar plug containing the plaques will be 
removed and placed in lambda dilution buffer containing 0.1M NaCl, 0.01M magnesium 
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sulfate, 0.O35M Tris HC1, pH 7.5, 0.01% gelatin. The phage may then be replated and 
rescreened to obtain single, well isolated positive plaques. Positive plaques may be isolated 
and the cDNA clones sequenced using primers based on the known cDNA sequence. This 
step may be repeated until a full length cDNA is obtained. 

It may be necessary to screen multiple cDNA libraries from different sources/tissues 
to obtain a full length cDNA. In the event that it is difficult to identify cDNA clones 
encoding the complete 5' terminal coding region, an often encountered situation in cDNA 
cloning, the RACE (Rapid Amplification of cDNA Ends) technique may be used. RACE is a 
proven PCR-based strategy for amplifying the 5' end of incomplete cDNAs. 5'-RACE-Ready 
cDNA synthesized from human fetal liver containing a unique anchor sequence is 
commercially available (Clontech). To obtain the 5' end of the cDNA, PCR is carried out, for 
example, on 5'-RACE-Ready cDNA using the provided anchor primer and the 3' primer. A 
secondary PCR reaction is then carried out using the anchored primer and a nested 3' primer 
according to the manufacturer's instructions. 

Once obtained, the full length cDNA sequence may be translated into amino acid 
sequence and examined for certain landmarks found in the amino acid sequences encoded by 
SEQ ID NOS:9-503, or any structural similarities to these disclosed sequences. 

The identification of homologs, heterologs, or paralogs of SEQ ID NOS:9-503 in 
other, preferably related, species can be useful for developing additional animal model 
systems that are closely related to humans for purposes of drug discovery. Genes at other 
genetic loci within the genome that encode proteins which have extensive homology to one or 
more domains of the gene products encoded by SEQ ID NOS:9-503 can also be identified via 
similar techniques. In the case of cDNA libraries, such screening techniques can identify 
clones derived from alternatively spliced transcripts in the same or different species. 

Screening can be done using filter hybridization with duplicate filters. The labeled 
probe can contain at least 15-30 base pairs of the nucleotide sequence presented in SEQ ID 
NOS:9-503. The hybridization washing conditions used should be of a lower stringency 
when the cDNA library is derived from an organism different from, or heterologous to, the 
type of organism from which the labeled sequence was derived. With respect to the cloning 
of a mammalian homolog, heterolog, ortholog, or paralog, using probes derived from any of 
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the sequences of SEQ ID NOS:9-503, for example, hybridization can, for example, be 
performed at 65° C overnight in Church's buffer (7% SDS, 250 mM NaHP0 4 , 2 mM EDTA, 
1% BSA). Washes can be done with 2XSSC, 0.1% SDS at 65° C and then at 0.1XSSC, 0.1% 
SDS at 65° C. 

Low stringency conditions are well known to those of skill in the art, and will vary 
predictably depending on the specific organisms from which the library and the labeled 
sequences are derived. For guidance regarding such conditions see, for example, Sambrook 
et al, 1989, Molecular Cloning, A Laboratory Manual, Cold Springs Harbor Press, N.Y.; and 
Ausubel et al, 1989, Current Protocols in Molecular Biology, Green Publishing Associates 
and Wiley Interscience, N.Y. 

Alternatively, the labeled nucleotide probe of a sequence of any of SEQ ID NOS:9- 
503 may be used to screen a genomic library derived from the organism of interest, again, 
using appropriately stringent conditions. The identification and characterization of human 
genomic clones is helpful for designing diagnostic tests and clinical protocols for treating 
disorders in human patients that are known or suspected to be linked to disease or other 
developmental or cell differentiation disorders and abnormalities. For example, sequences 
derived from regions adjacent to the intron/exon boundaries of the human gene can be used to 
design primers for use in amplification assays to detect mutations within the exons, introns, 
splice sites (e.g. , splice acceptor and/or donor sites), etc., that can be used in diagnostics. 

Further, gene homologs can also be isolated from nucleic acid of the organism of 
interest by performing PCR using two oligonucleotide primers derived from SEQ ID NOS:9- 
503 or two degenerate oligonucleotide primer pools designed on the basis of amino acid 
sequences within the gene products encoded by SEQ ID NOS:9-503. The template for the 
reaction may be cDNA obtained by reverse transcription of mRNA prepared from, for 
example, human or non-human cell lines, cell types, or tissues, like, for example, ES cells 
from the organism of interest. 

The PCR product may be subcloned or sequenced directly or subcloned and 
sequenced to ensure that the amplified sequences represent the sequences of the gene 
corresponding to the sequence of SEQ ID NOS:9-503 of interest. The PCR fragment may 
then be used to isolate a full length cDNA clone by a variety of methods. For example, the 
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amplified fragment may be labeled and used to screen a cDNA library, such as a 
bacteriophage cDNA library. Alternatively, the labeled fragment may be used to isolate 
genomic clones via the screening of a genomic library. 

PCR technology may also be utilized to isolate full length cDNA sequences. For 
example, RNA can be isolated using standard procedures from an appropriate cellular source 
(i.e., one known, or suspected, to express the gene corresponding to the sequence of SEQ ID 
NOS:9-503 of interest, such as, for example, ES cells). A reverse transcription reaction may 
be performed on the RNA using an oligonucleotide primer specific for the most 5' end of the 
amplified fragment for the priming of first strand synthesis. The resulting RNA/DNA hybrid 
may then be "tailed" with guanines, for example, using a standard terminal transferase 
reaction, the hybrid may be digested with RNase H, and second strand synthesis may then be 
primed with a poly-C primer. Thus, cDNA sequences upstream from the amplified fragment 
may easily be isolated. For a review of cloning strategies which may be used, see e.g., 
Sambrook et al, 1989, supra . Alternatively, cDNA or genomic libraries can be screened 
using 5' PCR primers that hybridize to vector sequences and 3' PCR primers specific to the 
gene of interest. Typically, such primers comprise oligonucleotide "priming" sequences first 
disclosed in, or otherwise unique to, one of the GTSs of SEQ ID NOS:9-503. 

The sequence of a gene corresponding to any of the sequences of SEQ ID NOS:9-503 
can also be used to isolate mutant alleles of that gene. Such mutant alleles may be isolated 
from individuals either known or suspected to have a genotype which contributes to the 
disease of interest or other symptoms of developmental and cell differentiation and/or 
proliferation disorders and abnormalities. Mutant alleles and mutant allele products may then 
be utilized in the therapeutic and diagnostic programs described below. Additionally, such 
sequences of any of the genes corresponding to SEQ ID NOS:9-503 can be used to detect 
gene regulatory (e.g., promoter or promoter/enchancer) defects which can affect development 
or cell differentiation. 

A cDNA of a mutant gene corresponding to any of the sequences of SEQ ID NOS:9- 
503 can be isolated as discussed above, or, for example, by using PCR. In this case, the first 
cDNA strand may be synthesized by hybridizing an oligo-dT oligonucleotide to mRNA 
isolated from cells derived from an individual suspected of carrying a mutant gene 
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corresponding to any of the sequences of SEQ ID NOS:9-503 by extending the new strand 
with reverse transcriptase. The second strand of the cDNA is then synthesized using an 
oligonucleotide that hybridizes specifically to the 5' region of the normal gene. The amplified 
product can be directly sequenced or cloned into a suitable vector and subsequently subjected 
5 to DNA sequence analysis. By comparing the DNA sequence of the mutant allele to that of 
the normal allele, the mutation(s) responsible for the loss or alteration of function of the 
mutant gene product can be ascertained. 

Alternatively, a genomic library can be constructed using DNA obtained from one or 
more individuals suspected of carrying, or known to carry, a mutant allele corresponding to 

10 any of SEQ ID NOS:9-503. Corresponding mutant cDNA libraries can be also constructed 
using RNA from cell types known, or suspected, to express such mutant alleles. The 
corresponding normal gene, or any suitable fragment thereof, may then be labeled and used as 
a probe to identify the corresponding mutant allele in such libraries. Clones containing the 
mutant gene sequences may then be identified and analyzed by DNA sequence analysis. 

15 Additionally, a protein expression library can be constructed utilizing cDNA synthesized 
from, for example, RNA isolated from a cell type known, or suspected, to express a mutant 
allele corresponding to any of the sequences of SEQ ID NOS:9-503 from an individual 
suspected of, carrying or known to carry, such a mutant allele. In this manner, gene products 
made by the putatively mutant cell type may be expressed and screened using standard 

20 antibody screening techniques in conjunction with antibodies raised against the 

corresponding normal gene product or a portion thereof, as described below in Section 5.4 
(For screening techniques, see, for example, Harlow, E. and Lane, eds., 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, Cold Spring Harbor.) Additionally, 
screening can be accomplished by screening with labeled fusion proteins. In cases where a 

25 mutation results in an expressed gene product with altered function (e.g. , as a result of a 
missense or a frame shift mutation), a polyclonal set of antibodies to the wild-type gene 
product are likely to cross-react with the mutant gene product. Library clones detected via 
their reaction with such labeled antibodies can be purified and subjected to sequence analysis 
according to methods well known to those of skill in the art. 
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The invention also encompasses nucleotide sequences that encode mutant isoforms of 
any of the amino acid sequences encoded by the GTSs of SEQ ID NOS:9-503, peptide 
fragments thereof, truncated versions thereof, and fusion proteins including any of the above. 
Examples of such fusion proteins can include, but not limited to, an epitope tag which aids in 
5 purification or detection of the resulting fusion protein; or an enzyme, fluorescent protein, 
luminescent protein which can be used as a marker. 

The present invention additionally encompasses (a) RNA or DNA vectors that contain 
any portion of SEQ ID NOS:9-503 and/or their complements as well as any of the peptides or 
proteins encoded thereby; (b) DNA vectors that contain a cDNA that substantially spans the 

10 entire open reading frame corresponding to any of the sequences of SEQ ID NOS:9-503 
and/or their complements; (c) DNA expression vectors that have or contain any of the 
foregoing sequences, or a portion thereof, operatively associated with a (d) genetically 
engineered host cells that contain a cDNA that spans the entire open reading frame, or any 
portion thereof, corresponding to any of the sequences of SEQ ID NOS:9-503 operatively 

15 associated with a regulatory element, generally recombinantly positioned either in vivo (such 
as in gene activation) or in vitro that directs the expression of the coding sequences in the 
host cell. As used herein, regulatory elements include, but are not limited to, inducible and 
non-inducible promoters, enhancers, operators and other elements known to those skilled in 
the art that drive and regulate expression. Such regulatory elements include, but are not 

20 limited to,the baculovirus promoter, cytomegalovirus hCMV immediate early gene promoter, 
the early or late promoters of SV40 adenovirus, the lac system, the trp system, the TAC 
system, the TRC system, the major operator and promoter regions of phage A, the control 
regions of fd coat protein, acid phosphatase promoters, phosphoglycerate kinase (PGK) and 
especially 3 -phosphoglycerate kinase promoters, and yeast alpha mating factors. 

25 An additional application of the described novel human polynucleotide sequences is 

their use in the molecular mutagenesis/evolution of proteins that are at least partially encoded 
by the described novel sequences using, for example, polynucleotide shuffling or related 
methodologies. Such approaches are described in U.S. Patents Nos. 5,830,721 and 5,837,458 
which are herein incorporated by reference in their entirety. 

30 
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5.2 PROTEINS AND POLYPEPTIDES ENCODED BY POLYNUCLEOTIDES 
EXPRESSED IN MODIFIED HUMAN CELLS 



Peptides and proteins encoded by the open reading frame of mRNAs corresponding to 
SEQ ID NOS:9-503, polypeptides and peptide fragments, mutated, truncated or deleted forms 
of those peptides and proteins, fusion proteins containing any of those peptides and proteins 
can be prepared for a variety of uses, including, but not limited to, the generation of 
antibodies, as reagents in diagnostic assays, the identification of other cellular gene products 
involved in the regulation of development and cellular differentiation of various cell types, 
like, for example, ES cells, as reagents in assays for screening for compounds that can be 
used in the treatment of disorders affecting development and cell differentiation, and as 
pharmaceutical reagents useful in the treatment of disorders affecting development and cell 
differentiation. 

The invention also encompasses proteins, peptides, and polypeptides that are 
functionally equivalent to those encoded by SEQ ID NOS:9-503. Such functionally 
equivalent products include, but are not limited to, additions or substitutions of amino acid 
residues within the amino acid sequence encoded by the nucleotide sequences described 
above, but which result in a silent change, thus producing a functionally equivalent gene 
product. Amino acid substitutions can be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues 
involved. For example, nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino 
acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; 
positively charged (basic) amino acids include arginine, lysine, and histidine; and negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. 

While random mutations can be introduced into DNA encoding peptides and proteins 
of the current invention (using random mutagenesis techniques well known to those skilled in 
the art), and the resulting mutant peptides and proteins tested for activity, site-directed 
mutations of the coding sequence can be engineered (using standard site-directed mutagenesis 
techniques) to generate mutant peptides and proteins of the current invention having 
increased functionality. 
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For example, the amino acid sequence of peptides and proteins of the current 
invention can be aligned with homologs from different species. Mutant peptides and proteins 
can be engineered so that regions of interspecies identity are maintained, whereas the variable 
residues are altered, e.g., by deletion or insertion of an amino acid residue(s) or by 
5 substitution of one or more different amino acid residues. Conservative alterations at the 
variable positions can be engineered in order to produce a mutant form of a peptide or protein 
of the current invention that retains function. Non-conservative changes can be engineered at 
these variable positions to alter function. Alternatively, where alteration of function is 
desired, deletion or non-conservative alterations of the conserved regions can be engineered. 

10 One of skill in the art may easily test such mutant or deleted form of a peptide or protein of 
the current invention for these alterations in function using the teachings presented herein. 

Other mutations to the coding sequences described above can be made to generate 
peptides and proteins that are better suited for expression, scale up, etc. in the host cells 
chosen. For example, the triplet code for each amino acid can be modified to conform more 

1 5 closely to the preferential codon usage of the host cell's translational machinery, or, for 

example, to yield a messenger RNA molecule with a longer half-life. Those skilled in the art 
would readily know what modifications of the nucleotide sequence would be desirable to 
conform the nucleotide sequence to preferential codon usage or to make the messenger RNA 
more stable. Such information would be obtainable, for example, through use of computer 

20 programs, through review of available research data on codon usage and messenger RNA 
stability, and through other means known to those of skill in the art. 

Peptides corresponding to one or more domains (or a portion of a domain) of one of 
the proteins described above, truncated or deleted proteins, as well as fusion proteins in which 
the full length protein described above, a subunit peptide or truncated version is fused to an 

25 unrelated protein are also within the scope of the invention and can be designed by those of 
skill in the art on the basis of experimental or functional considerations. Such fusion proteins 
include, but are not limited to, fusions to an epitope tag; or fusions to an enzyme, fluorescent 
protein, or luminescent protein which provide a marker function. 

While the peptides and proteins of the current invention can be chemically 

30 synthesized (e.g. , see Creighton, 1 983, Proteins: Structures and Molecular Principles, W.H. 
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Freeman & Co., N.Y.), large polypeptides derived from any of the polynucleotides described 
above may advantageously be produced by recombinant DNA technology using techniques 
well known in the art for expressing genes and/or coding sequences. These methods include, 
for example, in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic 
5 recombination. See, for example, the techniques described in Sambrook et al, 1989, supra, 
and Ausubel et al, 1989, supra. Alternatively, RNA capable of encoding any of the 
nucleotide sequences described above may be chemically synthesized using, for example, 
synthesizers. See, for example, the techniques described in "Oligonucleotide Synthesis", 
1984, Gait, M.J. ed., IRL Press, Oxford, which is incorporated by reference herein in its 
10 entirety. 

A variety of host-expression vector systems may be utilized to express the nucleotide 
sequences of the invention. Where the peptide or protein to be synthesized is a soluble 
derivative, the peptide or polypeptide can be recovered from the culture, i.e., from the host 
cell in cases where the peptide or polypeptide is not secreted, and from the culture media in 

1 5 cases where the peptide or polypeptide is secreted by the cells. However, such engineered 
host cells themselves may be used in situations where it is important not only to retain the 
structural and functional characteristics of the expressed peptide or protein, but to assess 
biological activity, e.g., in drug screening assays. 

The expression systems that may be used for purposes of the invention include, but 

20 are not limited to, microorganisms such as bacteria (e.g., E. coli, B. subtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing a nucleotide sequence of the current invention; yeast (e.g., Saccharomyces, 
Pichia) transformed with recombinant yeast expression vectors containing a nucleotide 
sequence of the current invention; insect cell systems infected with recombinant virus expres- 

25 sion vectors (e.g. , baculovirus) containing a nucleotide sequence of the current invention; 
plant cell systems infected with recombinant virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid 
expression vectors (e.g., Ti plasmid) containing a nucleotide sequence of the current 
invention; or mammalian cell systems (e.g., COS, CHO, BHK, 293, 3T3, U937) harboring 

30 recombinant expression constructs containing promoters derived from the genome of 
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mammalian cells {e.g., metallothionein promoter) or from mammalian viruses {e.g., the 
adenovirus late promoter; the vaccinia virus 7.5K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the gene product being expressed. For example, when 
5 large quantities of such a protein are to be produced for the generation of pharmaceutical 
compositions of a protein or for raising antibodies to the protein to be expressed, for example, 
vectors which direct the expression of high levels of fusion protein products that are readily 
purified may be desirable. Such vectors include, but are not limited to, the E. coli expression 
vector pUR278 (Ruther et al, 1983, EMBO J. 2:1791), in which the coding sequence of the 

1 0 polynucleotide to be expressed may be ligated individually into the vector in frame with the 
lacZ coding region so that a fusion protein is produced; pIN vectors (Inouye & Inouye, 1985, 
Nucleic Acids Res. 73:3101-3109; Van Heeke & Schuster, 1989, J. Biol. Chem. 264:5503- 
5509); and the like. pGEX vectors may also be used to express foreign polypeptides as 
fusion proteins with glutathione S-transferase (GST). If the inserted sequence encodes a 

15 relatively small polypeptide (less than 25 kD), such fusion proteins are generally soluble and 
can easily be purified from lysed cells by adsorption to glutathione-agarose beads followed 
by elution in the presence of free glutathione. The pGEX vectors are designed to include 
thrombin or factor Xa protease cleavage sites so that the cloned target gene product can be 
released from the GST moiety. Alternatively, if the resulting fusion protein is insoluble and 

20 forms inclusion bodies in the host cell, the inclusion bodies may be purified and the 
recombinant protein solubilized using techniques well known to one of skill in the art. 

In an insect system, Autographa californica nuclear polyhidrosis virus (AcNPV) may 
be used as a vector to express foreign genes, {e.g. , see Smith et al, 1983, J. Virol. 46: 584; 
Smith, U.S. Patent No. 4,215,051). In one embodiment of the current invention, Sf9 insect 

25 cells are infected with a baculovirus vector expressing a peptide or protein of the current 
invention. 

In mammalian host cells, a number of viral-based expression systems may be utilized. 
Specific embodiments (described more fully below) include the gene trap cDNA sequences of 
the current invention that are expressed by a CMV promoter to transiently express 
30 recombinant protein in U937 cells or in Cos-7 cells. Alternatively, retroviral vector systems 
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well known in the art may be used to insert the recombinant expression construct into host 
cells, or vaccinia virus-based expression systems may be employed. 

In yeast, a number of vectors containing constitutive or inducible promoters may be 
used. For a review, see Current Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel et 
5 al, Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant et al, 1987, Expression and 
Secretion Vectors for Yeast, in Methods in Enzymology, Eds. Wu & Grossman, 1987, Acad. 
Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA Cloning, Vol. II, IRL Press, Wash., 
D.C., Ch. 3; and Bitter, 1987, Heterologous Gene Expression in Yeast, Methods in 
Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol. 152, pp. 673-684; and The 

10 Molecular Biology of the Yeast Saccharomyces, 1982, Eds. Strathern et al, Cold Spring 
Harbor Press, Vols. I and II. 

In cases where plant expression vectors are used, the expression of the coding 
sequence may be driven by any of a number of promoters. For example, viral promoters such 
as the 35S RNA and 19S RNA promoters of CaMV (Brisson et al, 1984, Nature, 370:51 1- 

15 514), or the coat protein promoter of TMV (Takamatsu et al, 1987, EMBO J. 5:307-31 1) 
may be used; alternatively, plant promoters such as the small subunit of RUBISCO (Coruzzi 
etal, 1984, EMBO J. 5:1671-1680; Broglie etal, 1984, Science 224:838-843); or heat shock 
promoters, e.g., soybean hspl7.5-E or hspl7.3-B (Gurley etal, 1986, Mol. Cell. Biol. 6:559- 
565) may be used. These constructs can be introduced into plant cells using Ti plasmids, Ri 

20 plasmids, plant virus vectors, direct DNA transformation, microinjection, electroporation, etc. 
For reviews of such techniques see, for example, Weissbach & Weissbach, 1988, Methods for 
Plant Molecular Biology, Academic Press, NY, Section VIII, pp. 421-463; and Grierson & 
Corey, 1988, Plant Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9. 

In cases where an adenovirus is used as an expression vector, the nucleotide sequence 

25 of interest may be ligated to an adenovirus transcription/translation control complex, e.g. , the 
late promoter and tripartite leader sequence. This chimeric gene may then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region 
of the viral genome (e.g., region El or E3) will result in a recombinant virus that is viable and 
capable of expressing the gene product of interest in infected hosts, (e.g., See Logan & 

30 Shenk, 1984, Proc. Natl. Acad. Sci. USA 57:3655-3659). Specific initiation signals may also 
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be required for efficient translation of inserted nucleotide sequences of interest. These signals 
include the ATG initiation codon and adjacent sequences. In cases where an entire gene or 
cDNA, including its own initiation codon and adjacent sequences, is inserted into the 
appropriate expression vector, no additional translational control signals may be needed. 
However, in cases where only a portion of a coding sequence of interest is inserted, 
exogenous translational control signals, including, perhaps, the ATG initiation codon, must 
be provided. Furthermore, the initiation codon must be in phase with the reading frame of the 
desired coding sequence to ensure translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of a variety of origins, both natural 
and synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate 
transcription enchanter elements, transcription terminators, etc. (See Bittner et al, 1987, 
Methods inEnzymol. 753:516-544). 

In addition, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion 
desired. Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein 
products may be important for the function of the protein. Different host cells have charac- 
teristic and specific mechanisms for the post-translational processing and modification of 
proteins and gene products. Appropriate cell lines or host systems can be chosen to ensure 
the correct modification and processing of the foreign protein expressed. To this end, 
eukaryotic host cells which possess the cellular machinery for proper processing of the 
primary transcript may be used. Such mammalian host cells include, but are not limited 
to,CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and U937 cells. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. For example, cell lines which stably express the sequences of interest described 
above may be engineered. Rather than using expression vectors which contain viral origins 
of replication, host cells can be transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enchanter sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. Following the introduction of the 
foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched media, 
and then are switched to a selective media. The selectable marker in the recombinant plasmid 
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confers resistance to the selection and allows cells to stably integrate the plasmid into their 
chromosomes and grow to form foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines which express the gene 
product of interest. Such engineered cell lines may be particularly useful in screening and 
evaluation of compounds that affect the endogenous activity of the gene product of interest. 

A number of selection systems may be used, including but not limited to the herpes 
simplex virus thymidine kinase (Wigler et al, 1977, Cell 11:223), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska & Szybalski, 1962, Proc. Natl. Acad. Sci. USA 
48:2026), and adenine phosphoribosyltransferase (Lowy et al, 1980, Cell 22:817) genes can 
be employed in tkr, hgprt or aprt cells, respectively. Also, antimetabolite resistance can be 
used as the basis of selection for the following genes: dhfr, which confers resistance to 
methotrexate (Wigler et al, 1980, Natl. Acad. Sci. USA 77:3567; O'Hare et al, 1981, Proc. 
Natl. Acad. Sci. USA 75:1527); gpt, which confers resistance to mycophenolic acid 
(Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 75:2072); neo, which confers resistance 
to the aminoglycoside G-418 (Colberre-Garapin et al, 1981, J. Mol. Biol. 150:1); and hygro, 
which confers resistance to hygromycin (Santerre et al, 1984, Gene 50:147). 

The gene products of interest can also be expressed in transgenic animals. Animals of 
any species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 
goats, and non-human primates, e.g., baboons, monkeys, and chimpanzees may be used to 
generate transgenic animals carrying the polynucleotide of interest of the current invention. 

Any technique known in the art may be used to introduce the transgene of interest into 
animals to produce the founder lines of transgenic animals. Such techniques include, but are 
not limited to pronuclear microinjection (Hoppe, P.C. and Wagner, T.E., 1989, U.S. Pat. No. 
4,873,191); retrovirus mediated gene transfer into germ lines (Van der Putten et al, 1985, 
Proc. Natl. Acad. Sci., USA 52:6148-6152); gene targeting in embryonic stem cells 
(Thompson et al, 1989, Cell 56:313-321); electroporation of embryos (Lo, 1983, Mol Cell. 
Biol. 3:1803-1814); sperm-mediated gene transfer (Lavitrano et al, 1989, Cell 57:717-723); 
positive-negative selection as described in U.S. Patent No. 5,464,764 herein incorporated by 
reference. For a review of such techniques, see Gordon, 1989, Transgenic Animals, Intl. Rev. 
Cytol. 775:171-229, which is incorporated by reference herein in its entirety. 
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The present invention provides for transgenic animals that carry the transgene of 
interest in all their cells, as well as animals which carry the transgene in some, but not all 
their cells, i.e., mosaic animals. The transgene may be integrated as a single transgene or in 
concatamers, e.g. , head-to-head tandems or head-to-tail tandems. The transgene may also be 
selectively introduced into and activated in a particular cell type by following, for example, 
the teaching of Lasko et al. (Lasko, M. et al, 1992, Proc. Natl. Acad. Sci. USA 89:6232- 
6236). The regulatory sequences required for such a cell-type specific activation will depend 
upon the particular cell type of interest, and will be apparent to those of skill in the art. When 
it is desired that the transgene of interest be integrated into the chromosomal site of the 
endogenous copy of that same gene, gene targeting is preferred. Briefly, when such a 
technique is to be utilized, vectors containing some nucleotide sequences homologous to the 
endogenous gene of interest are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene of interest. In this way, the expression of the 
endogenous gene may also be eliminated by inserting non-functional sequences into the 
endogenous gene. The transgene may also be selectively introduced into a particular cell 
type, thus inactivating the endogenous gene of interest in only that cell type, by following, for 
example, the teaching of Gu et al (Gu et al, 1994, Science 265: 103-106). The regulatory 
sequences required for such a cell-type specific inactivation will depend upon the particular 
cell type of interest and will be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression of the recombinant gene 
of interest may be assayed utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PCR techniques to analyze animal tissues to assay 
whether integration of the transgene has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of cell type samples obtained 
from the animal, in situ hybridization analysis, and RT-PCR. Samples of gene-expressing 
tissue, may also be evaluated immunocytochemically using antibodies specific for the 
transgene product, as described below. 
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5.3 CELLS THAT CONTAIN A DISRUPTED ALLELE OF A GENE ENCODING 
A POLYNUCLEOTIDE OF THE CURRENT INVENTION 



Another aspect of the current invention are cells which contain a gene that encodes a 
polynucleotide of the current invention and that has been disrupted. Those of skill in the art 
would know how to disrupt a gene in a cell using techniques known in the art. Also, 
techniques useful to disrupt a gene in a cell and especially an ES cell, that may already be 
disrupted, as disclosed in copending US patent applications Nos. 08/726,867; 08/728,963; 
08/907,598; and 08/942,806, all of which are hereby incorporated herein by reference in their 
entirety, are within the scope of the current invention to disrupt a gene that encodes a 
polynucleotide of the current invention. 

5.3.1 IDENTIFICATION OF CELLS THAT EXPRESS GENES ENCODING 
POLYNUCLEOTIDES OF THE CURRENT INVENTION 

Host cells that contain coding sequence and/or express a biologically active gene 
product, or fragment thereof, encoded by a gene corresponding to a GTS present invention 
may be identified by at least four general approaches; (a) DNA-DNA or DNA-RNA 
hybridization; (b) the presence or absence of "marker" gene functions; (c) assessing the level 
of transcription as measured by the expression of mRNA transcripts in the host cell; and (d) 
detection of the gene product as measured by immunoassay, enzymatic assay, chemical assay, 
or by its biological activity. Prior to screening for gene expression, the host cells can first be 
treated in an effort to increase the level of expression of genes encoding polynucleotides of 
the current invention, especially in cell lines that produce low amounts of the mRNAs and/or 
peptides and proteins of the current invention. 

In the first approach, the presence of the coding sequence for peptides and proteins of 
the current invention inserted in the expression vector can be detected by DNA-DNA or 
DNA-RNA hybridization using probes comprising nucleotide sequences that are homologous 
to the coding sequence for peptides and proteins of the current invention, respectively, or 
portions or derivatives thereof. 

In the second approach, the recombinant expression vector/host system can be 
identified and selected based upon the presence or absence of certain "marker" gene functions 
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(e.g., thymidine kinase activity, resistance to antibiotics, resistance to methotrexate, 
transformation phenotype, occlusion body formation in baculovirus, etc.). For example, if the 
coding sequence for the peptide or protein of the current invention is inserted within a marker 
gene sequence of the vector, recombinants containing the coding sequence for the peptide or 
protein of the current invention can be identified by the absence of the marker gene function. 
Alternatively, a marker gene can be placed in tandem with the sequence for the peptide or 
protein of the current invention under the control of the same or different promoter used to 
control the expression of the coding sequence for the peptide or protein of the current 
invention. Expression of the marker in response to induction or selection indicates 
expression of the coding sequence for the peptide or protein of the current invention. 

In the third approach, transcriptional activity for the coding region of genes specific 
for peptides and proteins of the current invention can be assessed by hybridization assays. 
For example, RNA can be isolated and analyzed by Northern blot using a probe derived from 
a GTS, or any portion thereof. Alternatively, total nucleic acids of the host cell may be 
extracted and assayed for hybridization to such probes. Additionally, RT-PCR (using GTS 
specific oligos/products) may be used to detect low levels of gene expression in a sample, or 
in RNA isolated from a spectrum of different tissues, or PCR can be used can be used to 
screen a variety of cDNA libraries derived from different tissues to determine which tissues 
express a given GTS. 

In the fourth approach, the expression of the peptides and proteins of the current 
invention can be assessed immunologically, for example by Western blots, immunoassays 
such as radioimmuno-precipitation, enzyme-linked immunoassays and the like. This can be 
achieved by using an antibody and a binding partner specific to a peptide or protein of the 
current invention. 

5.4 ANTIBODIES TO PROTEINS OF THE CURRENT INVENTION 

Antibodies that specifically recognize one or more epitopes of a peptide or protein of 
the current invention, or epitopes of conserved variants of a peptide or protein at least 
partially encoded by a GTS of the present invention, or any and all peptide fragments thereof, 
are also encompassed by the invention. Such antibodies include, but are not limited 



33 



CA1 -228897.1 



to,polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, 
single chain antibodies, Fab fragments, F(ab') 2 fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any 
of the above. 

5 The antibodies of the invention may be used, for example, in the detection of the 

peptide or protein of interest of the current invention in a biological sample and may, 
therefore, be utilized as part of a diagnostic or prognostic technique whereby patients may be 
tested for abnormal amounts of these proteins. Such antibodies may also be utilized in 
conjunction with, for example, compound screening schemes as described, below in Section 

10 5.6 for the evaluation of the effect of test compounds on expression and/or activity of the 
gene products of interest of the current invention. Additionally, such antibodies can be used 
in conjunction with the gene therapy and gene delivery techniques described below to, for 
example, evaluate the normal and/or engineered peptide- or protein-expressing cells prior to 
their introduction into the patient. Such antibodies may additionally be used as a method for 

1 5 inhibiting the abnormal activity of a peptide or protein of interest at least partially encoded by 
a GTS of the present invention. Thus, such antibodies may, for example, be utilized as part 
of treatment methods for development and cell differentiation disorders. 

For the production of antibodies, various host animals may be immunized by injection 
with the peptide or protein of interest, a subunit peptide of such protein, a truncated 

20 polypeptide, functional equivalents of the peptide or protein, mutants of the peptide or 

protein, or denatured forms of the above. Such host animals may include, but are not limited 
to,rabbits, mice, and rats, to name but a few. Various adjuvants can be used to increase the 
immunological response, depending on the host species, including but not limited to Freund's 
(complete and incomplete), mineral gels such as aluminum hydroxide, surface active 

25 substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 

keyhole limpet hemocyanin, dinitrophenol, and potentially useful human adjutants such as 
BCG (bacille Calmette-Guerin) and Corynebacterium parvum. Polyclonal antibodies are 
heterogeneous populations of antibody molecules derived from the sera of the immunized 
animals. 
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Monoclonal antibodies, which are homogeneous populations of antibodies to a 
particular antigen, may be obtained by any technique which provides for the production of 
antibody molecules by continuous cell lines in culture. These include, but are not limited to, 
the hybridoma technique of Kohler and Milstein, (1975, Nature 255:495-497; and U.S. Patent 
No. 4,376,1 10), the human B-cell hybridoma technique (Kosbor et al, 1983, Immunology 
Today 4:12; Cole etal, 1983, Proc. Natl. Acad. Sci. USA 50:2026-2030), and the EBV- 
hybridoma technique (Cole et al, 1985, Monoclonal Antibodies And Cancer Therapy, Alan 
R. Liss, Inc., pp. 77-96). Such antibodies may be of any immunoglobulin class including 
IgG, IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma producing the mAb of this 
invention may be cultivated in vitro or in vivo. Production of high titers of mAbs in vivo 
makes this the presently preferred method of production. 

In addition, techniques developed for the production of "chimeric antibodies" 
(Morrison etal, 1984, Proc. Natl. Acad. Sci. USA, Si:6851-6855;Neuberger etal, 1984, 
Nature, 372:604-608; Takeda et al, 1985, Nature, 314:452-454) by splicing the genes from a 
mouse antibody molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are derived from different animal species, 
such as those having a variable region derived from a porcine mAb and a human 
immunoglobulin constant region. 

Alternatively, techniques described for the production of single chain antibodies (U.S. 
Patent 4,946,778; Bird, 1988, Science 242:423-426; Huston et al, 1988, Proc. Natl. Acad. 
Sci. USA 55:5879-5883; and Ward et al, 1989, Nature 334:544-546) can be adapted to 
produce single chain antibodies against gene products of interest. Single chain antibodies are 
formed by linking the heavy and light chain fragments of the Fv region via an amino acid 
bridge, resulting in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes may be generated by known 
techniques. For example, such fragments include, but are not limited to: the F(ab') 2 fragments 
which can be produced by pepsin digestion of the antibody molecule and the Fab fragments 
which can be generated by reducing the disulfide bridges of the F(ab') 2 fragments. 
Alternatively, Fab expression libraries may be constructed (Huse etal, 1989, Science, 
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246: 1275- 1281) to allow rapid and easy identification of monoclonal Fab fragments with the 
desired specificity. 

Antibodies to peptides and proteins that are fully or at least partially encoded by the 
described GTSs, or fragments or truncated versions thereof, can in turn be utilized to generate 
anti-idiotypic antibodies that "mimic" an epitope of the peptide or protein of interest, using 
techniques well known to those skilled in the art. (See, e.g., Greenspan & Bona, 1993, 
FASEB J 7f5):437-444; and Nissinoff, 1991, J. Immunol. 747(Sj:2429-2438). For example 
antibodies that bind to a regulatory peptide or protein of interest of the current invention and 
competitively inhibit the binding of such peptide or protein to any of its binding partners in 
the cell can be used to generate anti-idiotypes that "mimic" the peptide or protein of interest 
and, therefore, bind and neutralize the particular binding partner of the peptide or protein of 
interest. Such neutralizing antibodies, anti-idiotypes, Fab fragments of such antibodies, or 
humanized derivatives thereof, can be used in therapeutic regimens to mimic or neutralize 
(depending on the antibody) the effect of a particular peptide of interest, or a binding partner 
of a peptide or protein of interest. 

5.5 DIAGNOSIS OF DISORDERS AFFECTING DEVELOPMENT AND CELL 
DIFFERENTIATION 

A variety of methods can be employed for the diagnostic and prognostic evaluation of 
disorders involving developmental and differentiation processes, and for the identification of 
subjects having a predisposition to such disorders. 

Such methods may, for example, utilize reagents such as the nucleotide sequences 
described above, and antibodies to peptides and proteins of the current invention, as 
described, in Section 5.4. Specifically, such reagents may be used, for example, for: (1) the 
detection of the presence of gene mutations, or the detection of either over- or under- 
expression of the respective mRNAs relative to the non-disorder state; (2) the detection of 
either an over- or an under-abundance of the respective gene product relative to the non- 
disorder state; and (3) the detection of perturbations or abnormalities in the intra- and inter- 
cellular processes mediated by the respective peptides or proteins of the current invention. 
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The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one specific nucleotide sequence of the current 
invention or antibody reagent described herein, which may be conveniently used, e.g., in 
clinical settings, to diagnose patients exhibiting developmental or cell differentiation disorder 
abnormalities. 

For the detection of mutations in any of the genes described above, any nucleated cell 
can be used as a starting source for genomic nucleic acid. For the detection of gene 
expression or gene products, any cell type or tissue in which the gene of interest is expressed, 
such as, for example, ES cells, may be utilized. Specific examples of cells and tissues that 
can be analyzed using the claimed polynucleotides include, but are not limited to, endothelial 
cells, epithelial cells, islets, neurons or neural tissue, mesothelial cells, osteocytes, 
lymphocytes, chondrocytes, hematopoietic cells, immune cells, cells of the major glands or 
organs (e.g., lung, heart, stomach, pancreas, kidney, skin, etc.), exocrine and/or endocrine 
cells, embryonic and other stem cells, fibroblasts, and culture adapted and/or transformed 
versions of the above. Diseases or natural processes that can also be correlated with the 
expression of mutant, or normal, variants of the disclosed GTSs include, but are not limited 
to, aging, cancer, autoimmune disease, lupus, scleroderma, Crohn's disease, multiple 
sclerosis, inflammatory bowel disease, immune disorders, schizophrenia, psychosis, alopecia, 
glandular disorders, inflammatory disorders, ataxia telangiectasia, diabetes, skin disorders 
such as acne, eczema, and the like, osteo and rheumatoid arthritis, high blood pressure, 
atherosclerosis, cardiovascular disease, pulmonary disease, degenerative diseases of the 
neural or skeletal systems, Alzheimer's disease, Parkinson's disease, osteoporosis, asthma, 
developmental disorders or abnormalities, genetic birth defects,- infertility, epithelial 
ulcerations, and viral, parasitic, fungal, yeast, or bacterial infection. 

Primary, secondary, or culture-adapted variants of cancer cells/tissues can also be 
analyzed using the claimed polynucleotides. Examples of such cancers include, but are not 
limited to, Cardiac: sarcoma (angiosarcoma, fibrosarcoma, rhabdomyosarcoma, liposarcoma), 
myxoma, rhabdomyoma, fibroma, lipoma and teratoma; Lung: bronchogenic carcinoma 
(squamous cell, undifferentiated small cell, undifferentiated large cell, adenocarcinoma), 
alveolar (bronchiolar) carcinoma, bronchial adenoma, sarcoma, lymphoma, chondromatous 
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hamartoma, mesothelioma; Gastrointestinal: esophagus (squamous cell carcinoma, 
adenocarcinoma, leiomyosarcoma, lymphoma), stomach (carcinoma, lymphoma, 
leiomyosarcoma), pancreas (ductal adenocarcinoma, insulinoma, glucagonoma, gastrinoma, 
carcinoid tumors, vipoma), small bowel (adenocarcinoma, lymphoma, carcinoid tumors, 
5 Karposi's sarcoma, leiomyoma, hemangioma, lipoma, neurofibroma, fibroma), large bowel 
(adenocarcinoma, tubular adenoma, villous adenoma, hamartoma, leiomyoma); Genitourinary 
tract: kidney (adenocarcinoma, Wilm's tumor [nephroblastoma], lymphoma, leukemia), 
bladder and urethra (squamous cell carcinoma, transitional cell carcinoma, adenocarcinoma), 
prostate (adenocarcinoma, sarcoma), testis (seminoma, teratoma, embryonal carcinoma, 
10 teratocarcinoma, choriocarcinoma, sarcoma, interstitial cell carcinoma, fibroma, 

fibroadenoma, adenomatoid tumors, lipoma); Liver: hepatoma (hepatocellular carcinoma), 
cholangiocarcinoma, hepatoblastoma, angiosarcoma, hepatocellular adenoma, hemangioma; 
Bone: osteogenic sarcoma (osteosarcoma), fibrosarcoma, malignant fibrous histiocytoma, 
chondrosarcoma, Ewing's sarcoma, malignant lymphoma (reticulum cell sarcoma), multiple 
1 5 myeloma, malignant giant cell tumor, chordoma, osteochronfroma (osteocartilaginous 

exostoses), benign chondroma, chondroblastoma, chondromyxofibroma, osteoid osteoma and 
giant cell tumors; Nervous system: skull (osteoma, hemangioma, granuloma, xanthoma, 
osteitis deformans), meninges (meningioma, meningiosarcoma, gliomatosis), brain 
(astrocytoma, medulloblastoma, glioma, ependymoma, germinoma [pinealoma], glioblastoma 
20 multiforme, oligodendroglioma, schwannoma, retinoblastoma, congenital tumors), spinal 
cord (neurofibroma, meningioma, glioma, sarcoma); Gynecological: uterus (endometrial 
carcinoma), cervix (cervical carcinoma, pre-tumor cervical dysplasia), ovaries (ovarian 
carcinoma [serous cystadenocarcinoma, mucinous cystadenocarcinoma, endometrioid tumors, 
celioblastoma, clear cell carcinoma, unclassified carcinoma], granulosa-thecal cell tumors, 
25 Sertoli-Leydig cell tumors, dysgerminoma, malignant teratoma), vulva (squamous cell 
carcinoma, intraepithelial carcinoma, adenocarcinoma, fibrosarcoma, melanoma), vagina 
(clear cell carcinoma, squamous cell carcinoma, botryoid sarcoma [embryonal 
rhabdomyosarcoma], fallopian tubes (carcinoma); Hematologic: blood (myeloid leukemia 
[acute and chronic], acute lymphoblastic leukemia, chronic lymphocytic leukemia, 
30 myeloproliferative diseases, multiple myeloma, myelodysplasia syndrome), Hodgkin's 
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disease, non-Hodgkin's lymphoma [malignant lymphoma]; Skin: malignant melanoma, basal 
cell carcinoma, squamous cell carcinoma, Karposi's sarcoma, moles, dysplastic nevi, lipoma, 
angioma, dermatofibroma, keloids, psoriasis; Breast: carcinoma and sarcoma, and Adrenal 
glands: neuroblastoma. 

5 Nucleic acid-based detection techniques and peptide detection techniques that can be 

used to conduct the above analyses are described below. 

5.5.1. DETECTION OF THE GENES OF THE CURRENT INVENTION AND 
THEIR RESPECTIVE TRANSCRIPTS 

10 

Mutations within the genes of the current invention can be detected by utilizing a 
number of techniques. Nucleic acid from any nucleated cell can be used as the starting point 
for such assay techniques, and may be isolated according to standard nucleic acid preparation 
procedures which are well known to those of skill in the art. 

1 5 DNA may be used in hybridization or amplification assays of biological samples to 

detect abnormalities involving gene structure, including point mutations, insertions, deletions 
and chromosomal rearrangements. Such assays may include, but are not limited to, Southern 
analyses, single stranded conformational polymorphism analyses (SSCP), and PCR analyses. 
Such diagnostic methods for the detection of gene-specific mutations can involve for 

20 example, contacting and incubating nucleic acids including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, obtained from a sample, e.g., derived from a 
patient sample or other appropriate cellular source, with one or more labeled nucleic acid 
reagents including recombinant DNA molecules, cloned genes or degenerate variants thereof, 
as described above, under conditions favorable for the specific annealing of these reagents to 

25 their complementary sequences within the gene of interest of the current invention. 

Preferably, the lengths of these nucleic acid reagents are at least 15 to 30 nucleotides. After 
incubation, all non-annealed nucleic acids are removed from the nucleic acid molecule 
hybrid. The presence of nucleic acids which have hybridized, if any such molecules exist, is 
then detected. Using such a detection scheme, the nucleic acid from the cell type or tissue of 

30 interest can be immobilized, for example, to a solid support such as a membrane, or a plastic 
surface such as that on a microtiter plate or polystyrene beads. In this case, after incubation, 
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non-annealed, labeled nucleic acid reagents of the type described above are easily removed. 
Detection of the remaining, annealed, labeled nucleic acid reagents is accomplished using 
standard techniques well-known to those in the art. The gene sequences to which the nucleic 
acid reagents have annealed can be compared to the annealing pattern expected from a normal 

5 gene sequence in order to determine whether a gene mutation is present. 

Alternative diagnostic methods for the detection of gene specific nucleic acid 
molecules, in patient samples or other appropriate cell sources, may involve their 
amplification, e.g., by PCR (the experimental embodiment set forth in Mullis, K.B., 1987, 
U.S. Patent No. 4,683,202), followed by the detection of the amplified molecules using 

1 0 techniques well known to those of skill in the art. The resulting amplified sequences can be 
compared to those which would be expected if the nucleic acid being amplified contained 
only normal copies of the respective gene in order to determine whether a gene mutation 
exists. 

Additionally, well-known genotyping techniques can be performed to identify 
15 individuals carrying mutations in any of the genes of the current invention. Such techniques 
include, for example, the use of restriction fragment length polymorphisms (RFLPs), which 
involve sequence variations in one of the recognition sites for the specific restriction enzyme 
used. 

Furthermore, the polynucleotide sequences of the current invention may be mapped to 
20 chromosomes and specific regions of chromosomes using well known genetic and/or 

chromosomal mapping techniques. These techniques include in situ hybridization, linkage 
analysis against known chromosomal markers, hybridization screening with libraries or flow- 
sorted chromosomal preparations specific to known chromosomes, and the like. The 
technique of fluorescent in situ hybridization of chromosome spreads has been described, for 
25 example, in Verma et al. (1 988) Human Chromosomes: A Manual of Basic Techniques . 

Pergamon Press, New York. Fluorescent in situ hybridization of chromosomal preparations 
and other physical chromosome mapping techniques may be correlated with additional 
genetic map data. Examples of genetic map data can be found, for example, in Genetic 
Maps: Locus Maps of Complex Genomes. Book 5: Human Maps . O'Brien, editor, Cold 
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Spring Harbor Laboratory Press (1990). Comparisons of physical chromosomal map data 
may be of particular interest in detecting genetic diseases in carrier states. 

The level of expression of genes can also be assayed by detecting and measuring the 
transcription of such genes. For example, RNA from a cell type or tissue known, or 
suspected to express any of the genes of the current invention can be isolated and tested 
utilizing hybridization or PCR techniques (e.g., northern or RT PCR) such as those described, 
above. Such analyses may reveal both quantitative and qualitative aspects of the expression 
pattern of the respective gene, including activation or inactivation of gene expression. In situ 
hybridization using suitable radioactive labels, enzymatic labels, or chemically tagged forms 
of the described polynucleotide sequences can also be used to assess expression patterns in 
vivo. 

Additionally, an oligonucleotide or polynucleotide sequence first disclosed in at least 
a portion of one of the GTS sequences of SEQ ID NOS:9-503 can be used as a hybridization 
probe in conjunction with a solid support matrix/substrate (resins, beads, membranes, 
plastics, polymers, metal or metallized substrates, crystalline or polycrystalline substrates, 
etc.). Of particular note are spatially addressable arrays (i.e., gene chips, microtiter plates, 
etc.) of oligonucleotides and polynucleotides, or corresponding oligopeptides and 
polypeptides, wherein at least one of the biopolymers present on the spatially addressable 
array comprises an oligonucleotide or polynucleotide sequence first disclosed in at least one 
of the GTS sequences of SEQ ID NOS:9-503, or an amino acid sequence encoded thereby. 
Methods for attaching biopolymers to, or synthesizing biopolymers on, solid support 
matrices, and conducting binding studies thereon are disclosed in, inter alia, U.S. Patent Nos. 
5,556,752, 5,744,305, 4,631,211, 5,445,934, 5,252,743, 4,713,326, 5,424,186, and 4,689,405 
the disclosures of which are herein incorporated by reference in their entirety. 

Oligonucleotides corresponding to the described GTSs can be used as hybridization 
probes either singly or in chip format. For example, a series of such GTS oligonucleotide 
sequences, or the complements thereof, can he used to represent all or a portion of the 
described GTS sequences. The oligonucleotides, typically between about 16 to about 40 (or 
any whole number within the stated range) nucleotides in length, may partially overlap each 
other and/or the NHP sequence may be represented using oligonucleotides that do not 
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overlap. Accordingly, the described NHP polynucleotide sequences shall typically comprise 
at least about two or three distinct oligonucleotide sequences of at least about 18, and 
preferably about 25, nucleotides in length that are first disclosed in the described Sequence 
Listing. Such oligonucleotide sequences may begin at any nucleotide present within a 
sequence in the Sequence Listing and proceed in either a sense (5'-to-3') orientation vis-a-vis 
the described sequence or in an antisense orientation. 

Although the presently described GTSs have been specifically described using 
nucleotide sequence, it should be appreciated that each of the GTSs can uniquely be 
described using any of a wide variety of additional structural attributes, or combinations 
thereof. For example, a given GTS can be described by the net composition of the 
nucleotides present within a given region of the GTS in conjunction with the presence of one 
or more specific oligonucleotide sequence(s) first disclosed in the GTS. Alternatively, a 
restriction map specifying the relative positions of restriction endonuclease digestion sites, or 
various palindromic or other specific oligonucleotide sequences can be used to structurally 
describe a given GTS. Such restriction maps, which are typically generated by widely 
available computer programs (e.g., the University of Wisconsin GCG sequence analysis 
package, SEQUENCHER 3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can optionally be 
used in conjunction with one or more discrete nucleotide sequence(s) present in the GTS that 
can be described by the relative position of the sequence relative to one or more additional 
sequence(s) or one or more restriction sites present in the GTS. 

5.5.2 DETECTION OF THE GENE PRODUCTS OF THE CURRENT 
INVENTION 

Antibodies directed against wild type or mutant gene products of the current invention 
or conserved variants or peptide fragments thereof, which are discussed above in Section 5.4 
may also be used as diagnostics and prognostics for disorders affecting development and 
cellular differentiation, as described herein. Such diagnostic methods, may be used to detect 
abnormalities in the level of gene expression, or abnormalities in the structure and/or 
temporal, tissue, cellular, or subcellular location of the respective gene product, and may be 
performed in vivo or in vitro, such as, for example, on biopsy tissue. 
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The tissue or cell type to be analyzed will generally include those which are known, or 
suspected, to contain cells that express the respective gene. The protein isolation methods 
employed herein may, for example, be such as those described in Harlow and Lane (Harlow, 
E. and Lane, D., 1988, "Antibodies: A Laboratory Manual", Cold Spring Harbor Laboratory 
5 Press, Cold Spring Harbor, New York), which is incorporated herein by reference in its 

entirety. The isolated cells can be derived from cell culture or from a patient. The analysis of 
cells taken from culture may be a necessary step in the assessment of cells that could be used 
as part of a cell-based gene therapy technique or, alternatively, to test the effect of compounds 
on the expression of the respective gene. 

10 For example, antibodies, or fragments of antibodies, such as those described above in 

Section 5.4 are also useful in the present invention to quantitatively or qualitatively detect the 
presence of gene products of the current invention or conserved variants or peptide fragments 
thereof. This can be accomplished, for example, by immunofluorescence techniques 
employing a fluorescently labeled antibody (see below, this Section) coupled with light 

15 microscopic, flow cytometric, or fluorimetric detection. 

The antibodies (or fragments thereof) or fusion or conjugated proteins useful in the 
present invention may, additionally, be employed histologically, as in immunofluorescence, 
immunoelectron microscopy or non-immuno assays, for in situ detection of gene products of 
the current invention or conserved variants or peptide fragments thereof, or for catalytic 

20 subunit binding (in the case of labeled catalytic subunit fusion protein). 

In situ detection may be accomplished by removing a histological specimen from a 
patient, and applying thereto a labeled antibody or fusion protein of the present invention. 
The antibody (or fragment) or fusion protein is preferably applied by overlaying the labeled 
antibody (or fragment) onto a biological sample. Through the use of such a procedure, it is 

25 possible to determine not only the presence of the gene product of the current invention, or 
conserved variants or peptide fragments, but also its distribution in the examined tissue. 
Using the present invention, those of ordinary skill will readily perceive that any of a wide 
variety of histological methods (such as staining procedures) can be modified in order to 
achieve such in situ detection. 
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Immunoassays and non-immunoassays for gene products of the current invention or 
conserved variants or peptide fragments thereof will typically comprise incubating a sample, 
such as a biological fluid, a tissue extract, freshly harvested cells, or lysates of cells which 
have been incubated in cell culture, in the presence of a detectably labeled antibody capable 
5 of identifying the respective gene products of interest or conserved variants or peptide 

fragments thereof, and detecting the bound antibody by any of a number of techniques well- 
known in the art. 

The biological sample may be brought in contact with and immobilized onto a solid 
phase support or carrier such as nitrocellulose, or other solid support which is capable of 

10 immobilizing cells, cell particles or soluble proteins. The support may then be washed with 
suitable buffers followed by treatment with the detectably labeled antibody specific to the 
peptide or protein of interest of the current invention or with fusion protein. The solid phase 
support may then be washed with the buffer a second time to remove unbound antibody or 
fusion protein. The amount of bound label on solid support may then be detected by 

15 conventional means. 

"Solid phase support or carrier" is intended to encompass any support capable of 
binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either 

20 soluble to some extent or insoluble for the purposes of the present invention. The support 
material may have virtually any possible structural configuration so long as the coupled 
molecule is capable of binding to an antigen or antibody. Thus, the support configuration 
may be spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the 
external surface of a rod. Alternatively, the surface may be flat such as a sheet, test strip, etc. 

25 Preferred supports include polystyrene beads. Those skilled in the art will know many other 
suitable carriers for binding antibody or antigen, or will be able to ascertain the same by use 
of routine experimentation. 

The binding activity of a given lot of antibody or fusion protein may be determined 
according to well known methods. Those skilled in the art will be able to determine operative 

30 and optimal assay conditions for each determination by employing routine experimentation. 
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With respect to antibodies, one of the ways in which the antibody can be detectably 
labeled is by linking the same to an enzyme and use in an enzyme immunoassay (EIA) 
(Voller, "The Enzyme Linked Immunosorbent Assay (ELISA)", 1978, Diagnostic Horizons 
2:1-7, Microbiological Associates Quarterly Publication, Walkersville, MD); Voller et al, 

5 1978, J. Clin. Pathol. 37:507-520; Butler, 1981, Meth. Enzymol. 73:482-523; Maggio (ed.), 
1980, Enzyme Immunoassay, CRC Press, Boca Raton, FL,; Ishikawa et al, (eds.), 1981, 
Enzyme Immunoassay, Kgaku Shoin, Tokyo). The enzyme which is bound to the antibody 
will react with an appropriate substrate, preferably a chromogenic substrate, in such a manner 
as to produce a chemical moiety which can be detected, for example, by spectrophotometric, 

10 fluorimetric or by visual means. Enzymes which can be used to detectably label the antibody 
include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-5- 
steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate, dehydrogenase, 
triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, 
glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate 

1 5 dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be accomplished 
by colorimetric methods which employ a chromogenic substrate for the enzyme. Detection 
may also be accomplished by visual comparison of the extent of enzymatic reaction of a 
substrate in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a variety of other immunoassays. 

20 For example, by radioactively labeling the antibodies or antibody fragments, it is possible to 
detect the peptide or protein of interest through the use of a radioimmunoassay (RIA) (see, for 
example, Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on 
Radioligand Assay Techniques, The Endocrine Society, March, 1986, which is incorporated 
by reference herein). The radioactive isotope can be detected by such means as the use of a 

25 gamma counter or a scintillation counter or by autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When the 
fluorescently labeled antibody is exposed to light of the proper wave length, its presence can 
then be detected due to fluorescence. Among the most commonly used fluorescent labeling 
compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, 

30 allophycocyanin and fiuorescamine. 
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The antibody can also be detectably labeled using fluorescence emitting metals such 
as 152 Eu, or others of the lanthanide series. These metals can be attached to the antibody 
using such metal chelating groups as diethylenetriaminepentacetic acid (DTP A) or 
ethylenediaminetetraacetic acid (EDTA). 

5 The antibody also can be detectably labeled by coupling it to a chemiluminescent 

compound. The presence of the chemiluminescent-tagged antibody is then determined by 
detecting the presence of luminescence that arises during the course of a chemical reaction. 
Examples of particularly useful chemiluminescent labeling compounds are luminol, 
isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 

10 Likewise, a bio luminescent compound may be used to label the antibody of the 

present invention. Bioluminescence is a type of chemiluminescence found in biological 
systems in, which a catalytic protein increases the efficiency of the chemiluminescent 
reaction. The presence of a bioluminescent protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for labeling purposes include, but are 

15 not limited to, luciferin, luciferase and aequorin. 

An additional use of a peptide or polypeptide encoded by an oligonucleotide or 
polynucleotide sequence first disclosed in at least one of the GTS sequences of SEQ ID 
NOS:9-503 is by incorporating the sequence into a phage display, or other peptide 
library/binding, system that can be used to screen for proteins, or other ligands, that are 

20 capable of binding to an amino acid sequence encoded by an oligonucleotide or 

polynucleotide sequence first disclosed in at least one of the GTS sequences of SEQ ID 
NOS:9-503 (see U.S. Patents Nos. 5,270,170, and 5,432,018, herein incorporated by 
reference in their entirety). Moreover, peptide arrays comprising a novel amino acid 
sequence corresponding to a portion of at least one of the polynucleotide sequences first 

25 disclosed in SEQ ID NOS:9-503 can be generated and screened essentially as described in 
U.S. Patents Nos. 5,143,854, 5,405,783, and 5,252,743, the complete disclosures of which are 
herein incorporated by references. 

Additionally, the presently described GTSs, or primers derived therefrom, can be used 
to screen spatially addressable arrays, or pools therefrom, of clones present in a full-length 

30 human cDNA library. The 96 well microtiter plate format is especially well-suited to the 
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screening, by PCR for example, of pooled subtractions of cDNA clones. 



5.6 SCREENING ASSAYS FOR COMPOUNDS THAT MODULATE THE 

EXPRESSION OR ACTIVITY OF PEPTIDES AND PROTEINS OF THE 
5 CURRENT INVENTION 

The following assays are designed to identify compounds that interact with (e.g., bind 
to) peptides and proteins at least partially encoded by one of SEQ ID NOS:9-503 (i.e. 
peptides or proteins of the current invention) compounds that interact with (e.g., bind to) 

1 0 intracellular proteins that interact with peptides and proteins of the current invention, 

compounds that interfere with the interaction of peptides and proteins of the current invention 
with each other and with other intracellular proteins involved in developmental and cell 
differentiation processes, and to compounds which modulate the activity of genes of the 
current invention (i.e., modulate the level of expression of genes of the current invention) or 

15 modulate the level of gene products of the current invention. Assays may additionally be 
utilized which identify compounds which bind to gene regulatory sequences (e.g., promoter 
sequences) and which may modulate the expression of genes of the current invention. See 
e.g., Piatt, K.A., 1994, J. Biol. Chem. 269:28558-28562, which is incorporated herein by 
reference in its entirety. 

20 Compounds that can be screened in accordance with the invention include, but are not 

limited to, peptides, antibodies and fragments thereof, prostaglandins, lipids and other organic 
compounds (e.g. , terpines, peptidomimetics) that bind to the peptide or protein of interest of 
the current invention and either mimic the activity triggered by the natural ligand (i.e., 
agonists) or inhibit the activity triggered by the natural ligand (i.e., antagonists); as well as 

25 peptides, antibodies or fragments thereof, and other organic compounds that mimic the 
peptide or protein of interest of the current invention (or a portion thereof) and bind to and 
"neutralize" natural ligand. 

Such compounds may include, but are not limited to, peptides such as, for example, 
soluble peptides, including but not limited to members of random peptide libraries (see, e.g. , 

30 Lam, K.S. et al, 1991, Nature 354:82-84; Houghten, R. et al, 1991, Nature 354:84-86), and 
combinatorial chemistry-derived molecular library peptides made of D- and/or L- 
configuration amino acids, phosphopeptides (including, but not limited to, members of 
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random or partially degenerate, directed phosphopeptide libraries; see, e.g. , Songyang, Z. et 
al, 1993, Cell 72:767-778); antibodies (including, but not limited to, polyclonal, monoclonal, 
humanized, anti-idiotypic, chimeric or single chain antibodies, and Fab, F(ab') 2 and Fab 
expression library fragments, and epitope-binding fragments thereof); and small organic or 
5 inorganic molecules. 

Other compounds that can be screened in accordance with the invention include, but 
are not limited to, small organic molecules that are able to gain entry into an appropriate cell 
{e.g., in ES cells) and affect the expression of a gene of the current invention or some other 
gene involved in development and cell differentiation (e.g., by interacting with the regulatory 

10 region or transcription factors involved in gene expression); or such compounds that affect 
the activity of the peptide or protein of interest of the current invention, e.g., by inhibiting or 
enhancing the binding of such peptide or protein to another cellular peptide or protein, or 
other factor, necessary for catalysis, signal transduction, or the like, that is involved in 
developmental or cell differentiation processes. 

15 Computer modeling and searching technologies permit the identification of 

compounds, or the improvement of already identified compounds, that can modulate the 
expression or activity of peptides or proteins of interest of the current invention. Having 
identified such a compound or composition, the active sites or regions are identified. Such 
active sites might typically be the binding partner sites, such as, for example, the interaction 

20 domains of the peptides and proteins of the current invention with their respective binding 
partners. The active site can be identified using methods known in the art including, for 
example, from study of the amino acid sequences of peptides, from the nucleotide sequences 
of nucleic acids, or from study of complexes of the relevant compound or composition with 
its natural ligand. In the latter case, chemical or X-ray crystallographic methods can be used 

25 to find the active site by finding where on the factor the complexed ligand is found. 

Next, the three dimensional geometric structure of the active site is determined. This 
can be done by known methods, including X-ray crystallography, which can determine a 
complete molecular structure. On the other hand, solid or liquid phase NMR can be used to 
determine certain intra-molecular distances. Any other experimental method of structure 

30 determination can be used to obtain partial or complete geometric structures. The geometric 
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structures may be measured with a complexed ligand, natural or artificial, which may 
increase the accuracy of the active site structure determined. 

If an incomplete or insufficiently accurate structure is determined, the methods of 
computer based numerical modeling can be used to complete the structure or improve its 
accuracy. Any recognized modeling method may be used, including parameterized models 
specific to particular biopolymers such as proteins or nucleic acids, molecular dynamics 
models based on computing molecular motions, statistical mechanics models based on 
thermal ensembles, or combined models. For most types of models, standard molecular force 
fields, representing the forces between constituent atoms and groups, are necessary, and can 
be selected from force fields known in physical chemistry. The incomplete or less accurate 
experimental structures can serve as constraints on the complete and more accurate structures 
computed by these modeling methods. 

Finally, having determined the structure of the active site, either experimentally, by 
modeling, or by a combination, candidate modulating compounds can be identified by 
searching databases containing compounds along with information on their molecular 
structure. Such a search seeks compounds having structures that match the determined active 
site structure and that interact with the groups defining the active site. Such a search can be 
manual, but is preferably computer assisted. These compounds found from this search are 
potential modulating compounds of the peptides and proteins of interest of the current 
invention. 

Alternatively, these methods can be used to identify improved modulating compounds 
from an already known modulating compound or ligand. The composition of the known 
compound can be modified and the structural effects of modification can be determined using 
the experimental and computer modeling methods described above applied to the new 
composition. The altered structure is then compared to the active site structure of the 
compound to determine if an improved fit or interaction results. In this manner, systematic 
variations in composition, such as by varying side groups, can be quickly evaluated to obtain 
modified modulating compounds or ligands of improved specificity or activity. 

Further experimental and computer modeling methods useful to identify modulating 
compounds based upon identification of the active sites of peptides and proteins of interest of 
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the current invention, and related factors involved in development, cellular differentiation, 
and other cellular processes will be apparent to those of skill in the art. 

Examples of molecular modeling systems are the CHARM and QUANTA programs 
(Polygon Corporation, Waltham, MA). CHARM performs the energy minimization and 

5 molecular dynamics functions. QUANTA performs the construction, graphic modeling and 
analysis of molecular structure. QUANTA allows interactive construction, modification, 
visualization, and analysis of the behavior of molecules with each other. 

A number of articles review computer modeling of drugs interactive with specific 
proteins, such as Rotivinen et al, 1988, Acta Pharmaceutical Fennica 97:159-166; Ripka, 

10 New Scientist 54-57 (June 16, 1988); McKinaly and Rossmann, 1989, Annu. Rev. 

Pharmacol. Toxicol. 29: 11 1-122; Perry and Davies, OSAR: Quantitative Structure- Activity 
Relationships in Drug Design pp. 189-193 (Alan R. Liss, Inc. 1989); Lewis and Dean, 1989, 
Proc. R. Soc. Lond. 236: 1 25-140 and 141-162; and, with respect to a model receptor for 
nucleic acid components, Askew et al, 1989, J. Am. Chem. Soc. 777:1082-1090. Other 

15 computer programs that screen and graphically depict chemicals are available from 

companies such as BioDesign, Inc. (Pasadena, CA.), Allelix, Inc. (Mississauga, Ontario, 
Canada), and Hypercube, Inc. (Cambridge, Ontario). Although these are primarily designed 
for application to drugs specific to particular proteins, they can be adapted to the design of 
drugs specific to regions of DNA or RNA, once that region is identified. 

20 Although described above with reference to design and generation of compounds 

which could alter binding, one could also screen libraries of known compounds, including 
natural products or synthetic chemicals, and biologically active materials, including proteins, 
for compounds which are inhibitors or activators. 

Compounds identified via assays such as those described herein may be useful, for 

25 example, in elaborating the biological function of the gene products of interest of the current 
invention and for ameliorating disorders affecting development and cell differentiation. 
Assays for testing the effectiveness of compounds, identified by, for example, techniques 
such as those described below. 

30 
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5.6.1. 



IN VITRO SCREENING ASSAYS FOR COMPOUNDS THAT BIND TO 
PEPTIDES AND PROTEINS OF THE CURRENT INVENTION 



In vitro systems may be designed to identify compounds capable of interacting with 
5 (e.g. , binding to) peptides and proteins of interest of the current invention, fragments thereof, 
and variants thereof. The identified compounds can be useful, for example, in modulating the 
activity of wild type and/or mutant gene products of the current invention; may be utilized in 
screens for identifying compounds that disrupt normal interactions of the peptides and 
proteins of the current invention with other factors, like, for example, other peptides and 

10 proteins; or may in themselves disrupt such interactions. 

The principle of the assays used to identify compounds that bind to the peptides and 
proteins of the current invention involves preparing a reaction mixture of the peptides and 
proteins of interest that are disclosed by the current invention and a test compound under 
conditions and for a time sufficient to allow the two components to interact and bind, thus 

1 5 forming a complex that can be removed from and/or detected in the reaction mixture. The 
peptides and proteins of the current invention used can vary depending upon the goal of the 
screening assay. For example, where agonists of the natural ligand are sought, the full length 
peptide or protein of interest, or a fusion protein containing the subunit of interest fused to a 
protein or polypeptide that affords advantages in the assay system (e.g., labeling, isolation of 

20 the resulting complex, etc.) can be utilized. 

The screening assays can be conducted in a variety of ways. For example, one 
method of conducting such an assay involves anchoring the peptide or protein of interest, or a 
fragment or fusion protein thereof, or the test substance onto a solid phase and detecting 
peptide or protein of interest/test compound complexes anchored on the solid phase at the end 

25 of the reaction. In one embodiment of such a method, the peptide or protein of interest may 
be anchored onto a solid surface, and the test compound, which is not anchored, may be 
labeled, either directly or indirectly. In another embodiment of the method, a peptide or 
protein of interest of the current invention anchored on the solid phase is complexed with a 
natural ligand of such peptide or protein of interest. Then, a test compound could be assayed 

30 for its ability to disrupt the association of the complex. 
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In practice, microtiter plates may conveniently be utilized as the solid phase. The 
anchored component may be immobilized by non-covalent or covalent attachments. Non- 
covalent attachment may be accomplished by simply coating the solid surface with a solution 
of the protein and drying. Alternatively, an immobilized antibody, preferably a monoclonal 

5 antibody, specific for the peptide or protein to be immobilized may be used to anchor the 
peptide or protein to the solid surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the nonimmobilized component is added to the coated 
surface containing the anchored component. After the reaction is complete, unreacted 
components are removed (e.g., by washing) under conditions such that any complexes formed 

1 0 will remain immobilized on the solid surface. The detection of complexes anchored on the 
solid surface can be accomplished in a number of ways. Where the previously 
nonimmobilized component is pre-labeled, the detection of label immobilized on the surface 
indicates that complexes were formed. Where the previously nonimmobilized component is 
not pre-labeled, an indirect label can be used to detect complexes anchored on the surface; 

15 e.g. , using a labeled antibody specific for the previously nonimmobilized component (the 
antibody, in turn, may be directly labeled or indirectly labeled with a labeled anti-Ig 
antibody). 

Alternatively, a reaction can be conducted in a liquid phase, the reaction products 
separated from unreacted components, and complexes detected; e.g., using an immobilized 
20 antibody specific for one component of complexes formed, like, for example, the peptide or 
protein of interest of the current invention or the test compound to anchor any complexes 
formed in solution, and a labeled antibody specific for the other component of the possible 
complex to detect anchored complexes. 

25 5.6.2 ASSAYS FOR INTRACELLULAR PROTEINS THAT INTERACT 
WITH THE PEPTIDES AND PROTEINS OF THE CURRENT 
INVENTION . 

Any method suitable for detecting protein-protein interactions may be employed for 
30 identifying intracellular peptides and proteins that interact with peptides and proteins of the 
current invention. Among the traditional methods which may be employed are 
co-immunoprecipitation, crosslinking and co-purification through gradients or 
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chromatographic columns of cell lysates or proteins obtained from cell lysates and the 
peptides and proteins of the current invention to identify proteins in the lysate that interact 
with those peptides and proteins of the current invention. For these assays, the peptides and 
proteins of the current invention may be used in full length, or in truncated or modified forms 

5 or as fusion-proteins. Similarly, the component may be a complex of two or more of the 
peptides and proteins of the current invention. Once isolated, such an intracellular protein 
can be identified and can, in turn, be used in conjunction with standard techniques to identify 
proteins with which it interacts. For example, at least a portion of the amino acid sequence of 
an intracellular protein which interacts with a peptide or protein of the current invention, can 

10 be ascertained using techniques well known to those of skill in the art, such as via the Edman 
degradation technique. (See, e.g., Creighton, 1983, "Proteins: Structures and Molecular 
Principles", W.H. Freeman & Co., N.Y., pp.34-49). The amino acid sequence obtained may 
be used as a guide for the generation of oligonucleotide mixtures that can be used to screen 
for gene sequences encoding such intracellular proteins. Screening may be accomplished, for 

1 5 example, by standard hybridization or PCR techniques. Techniques for the generation of 
oligonucleotide mixtures and the screening are well-known. (See, e.g., Ausubel, supra. , and 
PCR Protocols: A Guide to Methods and Applications, 1990, Innis, M. et al, eds. Academic 
Press, Inc., New York). 

Additionally, methods may be employed which result in the simultaneous 

20 identification of genes which encode the intracellular proteins interacting with peptides and 
proteins of the current invention. These methods include, for example, probing expression 
libraries, in a manner similar to the well known technique of antibody probing of egtl 1 
libraries, using a labeled form of a peptide or protein of the current invention, or a fusion 
protein, e.g., a peptide or protein at least partially encoded by a GTS of the present invention 

25 fused to a marker {e.g., an enzyme, fluor, luminescent protein, or dye), or an Ig-Fc domain. 
One method that detects protein interactions in vivo, the two-hybrid system, is 
described in detail for illustration only and not by way of limitation. One version of this 
system has been described (Chien etal, 1991, Proc. Natl. Acad. Sci. USA, 88:9578-9582) 
and is commercially available from Clontech (Palo Alto, CA). 
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Briefly, utilizing such a system, plasmids are constructed that encode two hybrid 
proteins: one plasmid consists of nucleotides encoding the DNA-binding domain of a 
transcription activator protein fused to a nucleotide sequence of the current invention 
encoding a peptide or protein of the current invention, a modified or truncated form or a 
5 fusion protein, and the other plasmid consists of nucleotides encoding the transcription 
activator protein's activation domain fused to a cDNA encoding an unknown protein which 
has been recombined into this plasmid as part of a cDNA library. The DNA-binding domain 
fusion plasmid and the cDNA library are transformed into a strain of the yeast 
Saccharomyces cerevisiae that contains a reporter gene {e.g., HBS or lacZ) whose regulatory 

10 region contains the transcription activator's binding site. Either hybrid protein alone cannot 
activate transcription of the reporter gene; the DNA-binding domain hybrid cannot because it 
does not provide activation function, and the activation domain hybrid cannot because it 
cannot localize to the activator's binding sites. Interaction of the two hybrid proteins 
reconstitutes the functional activator protein and results in expression of the reporter gene, 

1 5 which is detected by an assay for the reporter gene product. 

The two-hybrid system or related methodology may be used to screen activation 
domain libraries for proteins that interact with the "bait" gene product. By way of example, 
and not by way of limitation, a peptide or protein of the current invention may be used as the 
bait gene product. Total genomic or cDNA sequences are fused to the DNA encoding an 

20 activation domain. This library and a plasmid encoding a hybrid of a bait gene product of the 
current invention fused to the DNA-binding domain are cotransformed into a yeast reporter 
strain, and the resulting transformants are screened for those that express the reporter gene. 
For example, and not by way of limitation, a bait gene sequence of the current invention can 
be cloned into a vector such that it is translationally fused to the DNA encoding the DNA- 

25 binding domain of the GAL4 protein. These colonies are purified and the library plasmids 
responsible for reporter gene expression are isolated. DNA sequencing is then used to 
identify the proteins encoded by the library plasmids. 

A cDNA library of the cell line from which proteins that interact with bait gene 
product of the current invention are to be detected can be made using methods routinely 

30 practiced in the art. According to the particular system described herein, for example, the 
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cDNA fragments can be inserted into a vector such that they are translationaliy fused to the 
transcriptional activation domain of GAL4. This library can be co-transfected along with the 
bait gene-GAL4 fusion plasmid into a yeast strain which contains a lacZ gene driven by a 
promoter which contains GAL4 activation sequence. A cDNA encoded protein, fused to 
5 GAL4 transcriptional activation domain, that interacts with bait gene product will reconstitute 
an active GAL4 protein and thereby drive expression of the HIS3 gene. Colonies which 
express HIS3 can be detected by their growth on petri dishes containing semi-solid agar based 
media lacking histidine. The cDNA can then be purified from these strains, and used to 
produce and isolate the bait gene-interacting protein using techniques routinely practiced in 



The macromolecules that interact with the peptides and proteins of the current 
invention are referred to, for purposes of this discussion, as "binding partners". These 
binding partners are likely to be involved in catalytic reactions or signal transduction 

20 pathways, and therefore, in the role of the peptides and proteins of the current invention in 
development and cell differentiation. It is also desirable to identify compounds that interfere 
with or disrupt the interaction of such binding partners with the peptides and proteins of the 
current invention which may be useful in regulating the activity of the peptides and proteins 
of the current invention and thus control development and cell differentiation disorders 

25 associated with the activity of the peptides and proteins of the current invention. 

The basic principle of the assay systems used to identify compounds that interfere 
with the interaction between the peptides and proteins of the current invention and its binding 
partner or partners involves preparing a reaction mixture containing the peptides or proteins 
of the current invention of interest, modified or truncated version thereof, or fusion proteins 

30 thereof as described above, and the binding partner under conditions and for a time sufficient 
to allow the two to interact and bind, thus forming a complex. In order to test a compound 
for inhibitory activity, the reaction mixture is prepared in the presence and absence of the test 
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5.6.3 ASSAYS FOR COMPOUNDS THAT INTERFERE WITH 

INTERACTIONS OF THE PEPTIDES AND PROTEINS OF THE 
CURRENT INVENTION WITH INTRACELLULAR 
MACROMOLECULES 
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compound. The test compound may be initially included in the reaction mixture, or may be 
added at a time subsequent to the addition of the peptide or protein of the current invention 
and its binding partner. Control reaction mixtures are incubated without the test compound or 
with a placebo. The formation of any complexes between the peptide or protein of the 
5 current invention and the binding partner is then detected. The formation of a complex in the 
control reaction, but not in the reaction mixture containing the test compound, indicates that 
the compound interferes with the interaction of the peptide or protein at least partially 
encoded by a GTS of the present invention and the interactive binding partner. Additionally, 
complex formation within reaction mixtures containing the test compound and normal 
10 peptide or protein of the current invention may also be compared to complex formation 

within reaction mixtures containing the test compound and a mutant peptide or protein of the 
current invention. This comparison can be important in those cases wherein it is desirable to 
identify compounds that disrupt interactions of mutant but not normal forms of a peptide or 
protein of the current invention. 
1 5 The assay for compounds that interfere with the interaction of a peptide or protein of 

the current invention and binding partners can be conducted in a heterogeneous or 
homogeneous format. Heterogeneous assays involve anchoring either the peptide or protein 
of the current invention or the binding partner onto a solid phase and detecting complexes 
anchored on the solid phase at the end of the reaction. In homogeneous assays, the entire 
20 reaction is carried out in a liquid phase. In either approach, the order of addition of reactants 
can be varied to obtain different information about the compounds being tested. For 
example, test compounds that interfere with the interaction by competition can be identified 
by conducting the reaction in the presence of the test substance; i.e., by adding the test 
substance to the reaction mixture prior to or simultaneously with the peptide or protein of the 
25 current invention and interactive binding partner. Alternatively, test compounds that disrupt 
preformed complexes, e.g. compounds with higher binding constants that displace one of the 
components from the complex, can be tested by adding the test compound to the reaction 
mixture after complexes have been formed. The various formats are described briefly below. 
In a heterogeneous assay system, either the peptide or protein of the current invention 
30 or the interactive binding partner, is anchored onto a solid surface, while the non-anchored 
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species is labeled either directly or indirectly. In practice, microtiter plates are conveniently 
utilized. The anchored species may be immobilized by non-covalent or covalent attachments. 
Non-covalent attachment may be accomplished simply by coating the solid surface with a 
solution of the peptide or protein of the current invention or binding partner and drying. 
5 Alternatively, an immobilized antibody specific for the species to be anchored may be used to 
anchor the species to the solid surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the partner of the immobilized species is exposed to the 
coated surface with or without the test compound. After the reaction is complete, unreacted 
components are removed {e.g., by washing) and any complexes formed will remain 

10 immobilized on the solid surface. The detection of complexes anchored on the solid surface 
can be accomplished in a number of ways. Where the non-immobilized species is pre- 
labeled, the detection of label immobilized on the surface indicates that complexes were 
formed. Where the non-immobilized species is not pre-labeled, an indirect label can be used 
to detect complexes anchored on the surface; e.g., using a labeled antibody specific for the 

1 5 initially non-immobilized species (the antibody, in turn, may be directly labeled or indirectly 
labeled with a labeled anti-Ig antibody). Depending upon the order of addition of reaction 
components, test compounds which inhibit complex formation or which disrupt preformed 
complexes can be detected. 

Alternatively, the reaction can be conducted in a liquid phase in the presence or 

20 absence of the test compound, the reaction products separated from unreacted components, 
and complexes detected; e.g., using an immobilized antibody specific for one of the binding 
components to anchor any complexes formed in solution, and a labeled antibody specific for 
the other partner to detect anchored complexes. Again, depending upon the order of addition 
of reactants to the liquid phase, test compounds which inhibit complex or which disrupt 

25 preformed complexes can be identified. 

In an alternate embodiment of the invention, a homogeneous assay can be used. In 
this approach, a preformed complex of the peptide or protein of the current invention and the 
interactive binding partner is prepared in which either the peptide or protein of the current 
invention or its binding partner is labeled, but the signal generated by the label is quenched 

30 due to formation of the complex (see, e.g., U.S. Patent No. 4,109,496 by Rubenstein which 
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utilizes this approach for immunoassays). The addition of a test substance that competes with 
and displaces one of the species from the preformed complex will result in the generation of a 
signal above background. In this way, test substances which disrupt peptide or protein of the 
current invention/intracellular binding partner interaction can be identified. 
5 In a particular embodiment, a peptide or protein of the current invention can be 

prepared for immobilization. For example, the peptide or protein of the current invention or a 
fragment thereof can be fused to a glutathione-S-transferase (GST) gene using a fusion 
vector, such as pGEX-5X-l, in such a manner that its binding activity is maintained in the 
resulting fusion protein. The interactive binding partner can be purified and used to raise a 

10 monoclonal antibody, using methods routinely practiced in the art and described above. This 
antibody can be labeled with the radioactive isotope 125 I, for example, by methods routinely 
practiced in the art. In a heterogeneous assay, e.g., the GST-peptide or protein of the current 
invention fusion protein can be anchored to glutathione-agarose beads. The interactive 
binding partner can then be added in the presence or absence of the test compound in a 

1 5 manner that allows interaction and binding to occur. At the end of the reaction period, 

unbound material can be washed away, and the labeled monoclonal antibody can be added to 
the system and allowed to bind to the complexed components. The interaction between the 
peptide or protein of the current invention and the interactive binding partner can be detected 
by measuring the amount of radioactivity that remains associated with the glutathione- 

20 agarose beads. A successful inhibition of the interaction by the test compound will result in a 
decrease in measured radioactivity. 

Alternatively, the GST-peptide or protein of the current invention fusion protein and 
the interactive binding partner can be mixed together in liquid in the absence of the solid 
glutathione-agarose beads. The test compound can be added either during or after the species 

25 are allowed to interact. This mixture can then be added to the glutathione-agarose beads and 
unbound material is washed away. Again the extent of inhibition of the peptide or protein of 
the current invention/binding partner interaction can be detected by adding the labeled 
antibody and measuring the radioactivity associated with the beads. 

In another embodiment of the invention, these same techniques can be employed 

30 using peptide fragments that correspond to the binding domains of a peptide or protein of the 
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current invention and/or the interactive or binding partner (in cases where the binding partner 
is a protein) in place of one or both of the full length proteins. Any number of methods 
routinely practiced in the art can be used to identify and isolate the binding sites. These 
methods include, but are not limited to, mutagenesis of the gene encoding one of the proteins 
5 and screening for disruption of binding in a co-immunoprecipitation assay. Compensating 
mutations in the gene encoding the second species in the complex can then be selected. 
Sequence analysis of the genes encoding the respective proteins will reveal the mutations that 
correspond to the region of the protein involved in interactive binding. Alternatively, one 
protein can be anchored to a solid surface using methods described above, and allowed to 

10 interact with and bind to its labeled binding partner, which has been treated with a proteolytic 
enzyme, such as trypsin. After washing, a short, labeled peptide comprising the binding 
domain may remain associated with the solid material, which can be isolated and identified 
by amino acid sequencing. Also, once the gene coding for the intracellular binding partner is 
obtained, short gene segments can be engineered to express peptide fragments of the protein, 

15 which can then be tested for binding activity and purified or synthesized. 

For example, and not by way of limitation, a peptide or protein of the current 
invention can be anchored to a solid material as described, above, by making a GST-peptide 
or protein of the current invention fusion protein and allowing it to bind to glutathione 
agarose beads. The interactive binding partner can be labeled with a radioactive isotope, such 

20 as 35 S, and cleaved with a proteolytic enzyme such as trypsin. Cleavage products can then be 
added to the anchored GST-peptide or protein of the current invention fusion protein and 
allowed to bind. After washing away unbound peptides, labeled bound material, representing 
the intracellular binding partner binding domain, can be eluted, purified, and analyzed for 
amino acid sequence by well-known methods. Peptides so identified can be produced 

25 synthetically or fused to appropriate facilitative proteins using recombinant DNA technology. 



5.6.4 ASSAYS FOR IDENTIFICATION OF COMPOUNDS THAT 

AMELIORATE DISORDERS AFFECTING DEVELOPMENT AND 
CELL DIFFERENTIATION 

30 

Compounds including, but not limited to, binding compounds identified via assay 
techniques such as those described above, can be tested for the ability to ameliorate 
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development and cell differentiation disorder symptoms. The assays described above can 
identify compounds which affect the activity of peptides and proteins of the current invention 
(e.g., compounds that bind to the peptides and proteins of the current invention, inhibit 
binding of their natural ligands, and compounds that bind to a natural ligand of the peptides 
5 and proteins of the current invention and neutralize the ligand activity); or compounds that 
affect the activity of genes encoding peptides and proteins of the current invention (by 
affecting the expression of those genes, including molecules, e.g., proteins or small organic 
molecules, that affect or interfere with splicing events so that expression of the genes of 
interest can be modulated). However, it should be noted that the assays described herein can 

10 also identify compounds that modulate signal transduction or catalytic events that the 

peptides and proteins of the current invention are involved in. The identification and use of 
such compounds which affect a step in, for example, signal transduction pathways or catalytic 
events in which any of the peptides and proteins of the current invention are involved in, may 
modulate the effect of the peptides and proteins of the current invention on developmental or 

15 cell differentiation disorders. Such identification and use of such compounds are within the 
scope of the invention. Such compounds can be used as part of a therapeutic method for the 
treatment of developmental and cell differentiation disorders. 

The invention encompasses cell-based and animal model-based assays for the 
identification of compounds exhibiting such an ability to ameliorate developmental and cell 

20 differentiation disorder symptoms. Such cell-based assay systems can also be used as the 
standard to assay for purity and potency of the natural ligand, catalytic subunit, including 
recombinantly or synthetically produced catalytic subunit and catalytic subunit mutants. 

Cell-based systems can be used to identify compounds which may act to ameliorate 
developmental or cell differentiation disorder symptoms. Such cell systems can include, for 

25 example, recombinant or non-recombinant cells, such as cell lines, which express the gene 
encoding the peptide or protein of interest of the current invention. For example ES cells, or 
cell lines derived from ES cells can be used. In addition, expression host cells (e.g., COS 
cells, CHO cells, fibroblasts, Sf9 cells) genetically engineered to express a functional peptide 
or protein of the current invention in addition to factors necessary for the peptide or protein of 
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the current invention to fulfil its physiological role of, for example, signal transduction or 
catalyses, can be used as an end point in the assay. 

In utilizing such cell systems, cells may be exposed to a compound suspected of 
exhibiting an ability to ameliorate developmental or cell differentiation disorder symptoms, at 
5 a sufficient concentration and for a time sufficient to elicit such an amelioration of such 
disorder symptoms in the exposed cells. After exposure, the cells can be assayed to measure 
alterations in the expression of the gene encoding the peptide or protein of interest of the 
current invention, e.g., by assaying cell lysates for the appropriate mRNA transcripts (e.g., by 
Northern analysis) or for expression of the peptide or protein of interest of the current 

10 invention in the cell; compounds which regulate or modulate expression of the gene encoding 
the peptide or protein of interest of the current invention are valuable candidates as 
therapeutics. Alternatively, the cells are examined to determine whether one or more 
developmental or cell differentiation disorder-like cellular phenotypes has been altered to 
resemble a more normal or more wild type phenotype, or a phenotype more likely to produce 

15 a lower incidence or severity of disorder symptoms. Still further, the expression and/or 

activity of components of pathways or functionally or physiologically connected peptides or 
proteins of which the peptide or protein of interest of the current invention is a part, can be 
assayed. 

For example, after exposure of the cells, cell lysates can be assayed for the presence of 
20 increased levels of the test compound as compared to lysates derived from unexposed control 
cells. The ability of a test compound to inhibit production of the assay compound such 
systems indicates that the test compound inhibits signal transduction initiated by the peptide 
or protein of interest of the current invention. Finally, a change in cellular morphology of 
intact cells may be assayed using techniques well known to those of skill in the art. 
25 In addition, animal-based development or cell differentiation disorder systems, which 

may include, for example, mice, may be used to identify compounds capable of ameliorating 
development or cell differentiation disorder-like symptoms. Such animal models may be 
used as test systems for the identification of drugs, pharmaceuticals, therapies and 
interventions which may be effective in treating such disorders. For example, animal models 
30 may be exposed to a compound, suspected of exhibiting an ability to ameliorate development 
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or cell differentiation disorder symptoms, at a sufficient concentration and for a time 
sufficient to elicit such an amelioration of development and/or cell differentiation disorder 
symptoms in the exposed animals. The response of the animals to the exposure may be 
monitored by assessing the reversal of disorders associated with development and/or cell 
5 differentiation disorders. With regard to intervention, any treatments which reverse any 

aspect of development or cell differentiation disorder-like symptoms should be considered as 
candidates for human development and/or cell differentiation disorder therapeutic 
intervention. Dosages of test agents may be determined by deriving dose-response curves, as 
discussed below. 

10 

5.7 THE TREATMENT OF DISORDERS ASSOCIATED WITH STIMULATION 
OF PEPTIDES AND PROTEINS OF THE CURRENT INVENTION 

The invention also encompasses methods and compositions for modifying 
1 5 development and cell differentiation and treating development and cell differentiation 

disorders. For example, one may decrease the level of expression of one or more genes of the 
current invention, and/or downregulate activity of one or more of the peptides or proteins of 
interest of the current invention. Thereby, the response of cells, like, for example, ES cells, to 
factors which activate the physiological responses that enhance the pathological processes 
20 leading to developmental and cell differentiation disorders may be reduced and the symptoms 
ameliorated. Conversely, the response of cells, like, for example, ES cells, to physiological 
stimuli involving any of the peptides or proteins of the current invention and necessary for 
proper developmental and cell differentiation processes may be augmented by increasing the 
activity of one or several of the peptides or proteins of interest of the current invention. 
25 Different approaches are discussed below. 

5.7. 1 INHIBITION OF PEPTIDES AND PROTEINS OF THE CURRENT 
INVENTION TO REDUCE DEVELOPMENT AND CELL 
DIFFERENTIATION DISORDERS 

30 

Any method which neutralizes the catalytic or signal transduction activity of the 
peptides and proteins of the current invention or which inhibits expression of the genes 
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encoding peptides and proteins (either transcription or translation) can be used to reduce 
symptoms associated with developmental and cell differentiation disorders. 

In one embodiment, immuno therapy can be designed to reduce the level of 
endogenous gene expression for the peptides and proteins of the current invention, e.g., using 
5 antisense or ribozyme approaches to inhibit or prevent translation of mRNA transcripts; triple 
helix approaches to inhibit transcription of the genes; or targeted homologous recombination 
to inactivate or "knock out" the genes or its endogenous promoter. 

Antisense approaches involve the design of oligonucleotides (either DNA or RNA) 
that are complementary to mRNA specific for peptides and proteins of interest of the current 

10 invention. The antisense oligonucleotides will bind to the complementary mRNA transcripts 
and prevent translation. Absolute complementarity, although preferred, is not required. A 
sequence "complementary" to a portion of an RNA, as referred to herein, means a sequence 
having sufficient complementarity to be able to hybridize with the RNA, forming a stable 
duplex. In the case of double-stranded antisense nucleic acids, a single strand of the duplex 

15 DNA may thus be tested, or triplex formation may be assayed. The ability to hybridize will 
depend on both the degree of complementarity and the length of the antisense nucleic acid. 
Generally, the longer the hybridizing nucleic acid, the more base mismatches with an RNA it 
may contain and still form a stable duplex (or triplex, as the case may be). One skilled in the 
art can ascertain a tolerable degree of mismatch by use of standard procedures to determine 

20 the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, e.g., the 5' 
untranslated sequence up to and including the AUG initiation codon, should work most 
efficiently at inhibiting translation. However, sequences complementary to the 3' 
untranslated sequences of mRNAs have recently shown to be effective at inhibiting 

25 translation of mRNAs as well. See generally, Wagner, R., 1994, Nature 372:333-335. Thus, 
oligonucleotides complementary to either the 5'- or 3'- non- translated, non-coding regions of 
the mRNAs specific for the peptides and proteins of the current invention could be used in an 
antisense approach to inhibit translation of those endogenous mRNAs. Oligonucleotides 
complementary to the 5' untranslated region of the mRNA should include the complement of 

30 the AUG start codon. Antisense oligonucleotides complementary to mRNA coding regions 
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are less efficient inhibitors of translation but could be used in accordance with the invention. 
Whether designed to hybridize to the 5'-, 3'- or coding region of an mRNA, antisense nucleic 
acids should be at least six nucleotides in length, and are preferably oligonucleotides ranging 
from 6 to about 50 nucleotides in length. In specific aspects the oligonucleotide is at least 10 
5 nucleotides, at least 17 nucleotides, at least 25 nucleotides or at least 50 nucleotides. 

Regardless of the choice of target sequence, it is preferred that in vitro studies are first 
performed to quantitate the ability of the antisense oligonucleotide to inhibit gene expression. 
It is preferred that these studies utilize controls that distinguish between antisense gene 
inhibition and nonspecific biological effects of oligonucleotides. It is also preferred that these 

10 studies compare levels of the target RNA or protein with that of an internal control RNA or 
protein. Additionally, it is envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a control oligonucleotide. It is 
preferred that the control oligonucleotide is of approximately the same length as the test 
oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from the 

15 antisense sequence no more than is necessary to prevent specific hybridization to the target 
sequence. 

The oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double-stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve 

20 stability of the molecule, hybridization, etc. The oligonucleotide may include other appended 
groups such as peptides (e.g. , for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl. Acad. Sci. 
U.S.A. 55:6553-6556; Lemaitre et al, 1987, Proc. Natl. Acad. Sci. 54:648-652; PCT 
Publication No. WO88/09810, published December 15, 1988), or hybridization-triggered 

25 cleavage agents. (See, e.g., Krol et al, 1988, BioTechniques 5:958-976) or intercalating 
agents. (See, e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, the oligonucleotide may 
be conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking 
agent, transport agent, hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety which 

30 is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 
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5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
5 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil 5 beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 

2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 

10 5 -oxy acetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar moiety 
selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, xylulose, 
and hexose. 

15 In another embodiment, the antisense oligonucleotide comprises at least one modified 

phosphate backbone selected from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 
methylphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an alpha-anomeric 

20 oligonucleotide. An alpha-anomeric oligonucleotide forms specific double-stranded hybrids 
with complementary RNA in which, contrary to the usual alpha-units, the strands run parallel 
to each other (Gautier et al, 1987, Nucl. Acids Res. 75:6625-6641). The oligonucleotide is a 
2'-0-methylribonucleotide (Inoue et al, 1987, Nucl. Acids Res. 75:6131-6148), or a chimeric 
RNA-DNA analogue (Inoue etal, 1987, FEBS Lett. 275:327-330). 

25 Oligonucleotides of the invention may be synthesized by standard methods known in 

the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al., 1988, Nucl. Acids Res. 76:3209. 
Methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer 

30 supports (Sarin et al. , 1 988, Proc. Natl. Acad. Sci. U.S.A. 55:7448-745 1). 
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While antisense nucleotides complementary to the coding region sequence specific for 
the peptides and proteins of the current invention could be used, those complementary to the 
transcribed untranslated region are most preferred. 

The antisense molecules should be delivered to cells which express the peptides and 
5 proteins of interest of the current invention in vivo, like, for example, ES cells. A number of 
methods have been developed for delivering antisense DNA or RNA to cells; e.g., antisense 
molecules can be injected directly into the tissue or cell derivation site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or antibodies 
that specifically bind receptors or antigens expressed on the target cell surface) can be 

10 administered systemically. 

However, it is often difficult to achieve intracellular concentrations of antisense 
molecules that are sufficient to suppress translation of endogenous mRNAs. Therefore a 
preferred approach utilizes a recombinant DNA construct in which the antisense 
oligonucleotide is placed under the control of a strong pol III or pol II promoter. The use of 

1 5 such a construct to transfect target cells in the patient will result in the transcription of 

sufficient amounts of single stranded RNAs that will form complementary base pairs with the 
endogenous transcripts specific for the peptides and proteins of interest of the current 
invention and thereby prevent translation of the respective mRNAs. For example, a vector 
can be introduced in vivo such that it is taken up by a cell and directs the transcription of an 

20 antisense RNA. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replication and expression in mammalian 
cells. Expression of the sequence encoding the antisense RNA can be by any promoter 

25 known in the art to act in mammalian, preferably human cells. Such promoters can be 
inducible or constitutive. Such promoters include, but are not limited to: the SV40 early 
promoter region (Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained 
in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al, 1980, Cell 22:787- 
797), the herpes thymidine kinase promoter (Wagner et al, 1981, Proc. Natl. Acad. Sci. 

30 U.S.A. 78:1441-1445), the regulatory sequences of the metallothionein gene (Brinster et al, 
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1982, Nature 296:39-42), etc. Any type of plasmid, cosmid, YAC or viral vector can be used 
to prepare the recombinant DNA construct which can be introduced directly into the tissue or 
cell derivation site; e.g. , the bone marrow. Alternatively, viral vectors can be used which 
selectively infect the desired tissue or cell type; (e.g., viruses which infect cells of 
5 hematopoietic lineage), in which case administration may be accomplished by another route 
(e.g., systemically). 

Ribozyme molecules designed to catalytically cleave mRNA transcripts specific for 
the peptides and proteins of interest of the current invention can also be used to prevent 
translation of the mRNAs of interest and expression of the peptides and proteins encoded by 

10 those mRNAs. (See, e.g., PCT International Publication WO90/1 1364, published October 4, 
1990; Sarver et al, 1990, Science 247:1222-1225). While ribozymes that cleave mRNA at 
site specific recognition sequences can be used to destroy mRNAs, the use of hammerhead 
ribozymes is preferred. Hammerhead ribozymes cleave mRNAs at locations dictated by 
flanking regions that form complementary base pairs with the target mRNA. The sole 

15 requirement is that the target mRNA have the following sequence of two bases: 5'-UG-3'. 
The construction and production of hammerhead ribozymes is well known in the art and is 
described more fully in Haseloff and Gerlach, 1988, Nature, 334:585-591. Preferably the 
ribozyme is engineered so that the cleavage recognition site is located near the 5' end of the 
mRNA of interest; i.e., to increase efficiency and minimize the intracellular accumulation of 

20 non-functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one which occurs naturally in Tetrahymena 
Thermophila (known as the IVS, or L-19 IVS RNA) and which has been extensively 
described by Thomas Cech and collaborators (Zaug et al, 1984, Science, 224:574-578; Zaug 

25 and Cech, 1 986, Science, 231 :470-475; Zaug et al , 1986, Nature, 324:429-43 3 ; published 
International Patent Application No. WO 88/04300 by University Patents Inc.; Been and 
Cech, 1986, Cell, 47:207-216). The Cech-type ribozymes have an eight base pair active site 
which hybridizes to a target RNA sequence where after cleavage of the target RNA takes 
place. The invention encompasses those Cech-type ribozymes which target eight base-pair 
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active site sequences that are present in the mRNAs specific for the peptides and proteins of 
interest of the current invention. 

As in the antisense approach, the ribozymes can be composed of modified 
oligonucleotides (e.g. for improved stability, targeting, etc.) and should be delivered to cells 
5 which express the peptides and proteins of interest of the current invention in vivo, like, for 
example, ES cells. A preferred method of delivery involves using a DNA construct 
"encoding" the ribozyme under the control of a strong constitutive pol III or pol II promoter, 
so that transfected cells will produce sufficient quantities of the ribozyme to destroy the 
endogenous messages specific for the peptides and proteins of interest of the current 

10 invention and inhibit translation. Because ribozymes unlike antisense molecules, are 
catalytic, a lower intracellular concentration is required for efficiency. 

Endogenous gene expression can also be reduced by inactivating or "knocking out" 
the gene of interest specific for a peptide or protein of the current invention or its promoter 
using targeted homologous recombination, (e.g., see Smithies et al., 1985, Nature 377:230- 

15 234; Thomas & Capecchi, 1987, Cell 57:503-512; Thompson et al, 1989 Cell 5:313-321; 
each of which is incorporated by reference herein in its entirety). For example, a mutant, 
non-functional peptide or protein of interest of the current invention (or a completely 
unrelated DNA sequence) flanked by DNA homologous to the endogenous gene encoding 
said peptide or protein of interest of the current invention (either the coding regions or 

20 regulatory regions of the gene) can be used, with or without a selectable marker and/or a 

negative selectable marker, to transfect cells that express said peptide or protein of interest of 
the current invention in vivo. Insertion of the DNA construct, via targeted homologous 
recombination, results in inactivation of the targeted endogenous gene. Such approaches are 
particularly suited in the agricultural field where modifications to ES cells can be used to 

25 generate animal offspring with an inactive copy of a gene encoding a peptide or protein of 
interest of the current invention (e.g., see Thomas & Capecchi 1987 and Thompson 1989, 
supra) . However this approach can be adapted for use in humans provided the recombinant 
DNA constructs are directly administered or targeted to the required site in vivo using 
appropriate viral vectors. 
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Alternatively, endogenous expression of a gene of interest can be reduced by targeting 
deoxyribonucleoti.de sequences complementary to the regulatory region of said gene (i.e., the 
promoter and/or enhancers) to form triple helical structures that prevent transcription of the 
gene of interest in target cells in the body. (See generally, Helene, C. 1991, Anticancer Drug 
5 Des., 6767:569-84; Helene, C. et al, 1992, Ann, N.Y. Acad. Sci., 660:27-36; and Maher, L.J., 
1992, Bioassays 14(12):%Ql-\5). 

In yet another embodiment of the invention, the activity of a peptide or protein of 
interest of the current invention can be reduced using a "dominant negative" approach. A 
dominant negative approach takes advantage of the interaction of the peptides or proteins of 

10 interest with other peptides or proteins to form complexes, the formation of which is a 
prerequisite for the peptide or protein of interest of the current invention to exert its 
physiological activity. To this end, constructs which encode a defective form of the peptide 
or protein of interest of the current invention can be used in gene therapy approaches to 
diminish the activity of said peptide or protein of interest in appropriate target cells. 

15 Alternatively, targeted homologous recombination can be utilized to introduce such deletions 
or mutations into the subject's endogenous gene encoding the peptide or protein of interest of 
the current invention in the appropriate tissue. The engineered cells will express non- 
functional copies of the peptide or protein of interest of the current invention, thereby 
downregulating its activity in vivo. Such engineered cells should demonstrate a diminished 

20 response to physiological stimuli of the activity of the affected peptide or protein of interest 
of the current invention, resulting in reduction of the development or cell differentiation 
disorder phenotype. 



5.7.2 RESTORATION OR INCREASE IN EXPRESSION OR ACTIVITY OF 
25 A PEPTIDE OR PROTEIN OF THE CURRENT INVENTION TO 

PROMOTE DEVELOPMENT OR CELL DIFFERENTIATION 



With respect to an increase in the level of normal gene expression and/or gene product 
30 activity specific for any of the peptides and proteins of interest of the current invention, the 
respective nucleic acid sequences can be utilized for the treatment of development and cell 
differentiation disorders. Where the cause of the development or cell differentiation 
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dysfunction is a defective peptide or protein of the current invention, treatment can be 
administered, for example, in the form of gene delivery or gene therapy. Specifically, one or 
more copies of a normal gene or a portion of the gene that directs the production of a gene 
product exhibiting normal function of the appropriate peptide or protein of the current 
5 invention, may be inserted into the appropriate cells within a patient or animal subject, 
optionally using suitable vectors. Recombinant retroviruses have been widely used in gene 
transfer or gene delivery experiments and even human clinical trials (see generally, Mulligan, 
R.C., Chapter 8, In: Experimental Manipulation of Gene Expression, Academic Press, pp. 
155-173 (1983); Coffin, J., In: RNA Tumor Viruses. Weiss, R. et al. (eds.), Cold Spring 

10 Harbor Laboratory, Vol. 2, pp. 36-38 (1985). Other eucaryotic viruses which have been used 
as vectors to transduce mammalian cells include adenovirus, papilloma virus, herpes virus, 
adeno-associated virus, vaccinia virus, rabies virus, and the like (See generally, Sambrook et 
al, Molecular Cloning . Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York, Vol. 3:16.1-16.89 (1989). Alternatively, cationic or other lipids may be employed to 

15 deliver polynucleotides comprising (or including) the described GTS sequences to patients. 
Additionally, naked DNA comprising one or more GTS sequences, optionally modified by 
the addition of one or more of, in operable combination and orientation, a promoter, an 
enhancer, a ribosome entry or ribosome binding site, and/or an in-frame translation initiation 
codon can be employed to deliver GTSs to a patient. Another use of the above constructs 

20 includes "naked" DNA vaccines that can be introduced in vivo alone, or in conjunction with 
excipients, or microcarrier spheres, nanoparticles or other supporting or dosaging compounds 
or molecules. 

The gene replacement/delivery therapies described above should be capable of 
delivering gene sequences to the cell types within patients which express the peptide or 
25 protein of interest of the current invention. Alternatively, targeted homologous 

recombination can be utilized to correct the defective endogenous gene in the appropriate cell 
type. In animals, targeted homologous recombination can be used to correct the defect in ES 
cells in order to generate offspring with a corrected trait. 

Finally, compounds identified in the assays described above that stimulate, enhance, 
30 or modify the activity of the peptides and proteins of the current invention can be used to 
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achieve proper development and cell differentiation. The formulation and mode of 
administration will depend upon the physico-chemical properties of the compound. 

5-8 PHARMACEUTICAL PREPARATIONS AND METHODS OF 
5 ADMINISTRATION 

Compounds that are determined to affect gene expression of the peptides and proteins 
of the current invention, comprise nucleotide sequence information that is at least partially 
first disclosed in the Sequence Listing (i.e., sequences used in antisense, gene therapy, 

10 dsRNA, or ribozyme applications), or the interaction of such peptides and proteins with any 
of their binding partners, can be administered to a patient at therapeutically effective doses to 
treat or ameliorate development and cell differentiation disorders. A therapeutically effective 
dose refers to that amount of the compound sufficient to result in any amelioration or 
retardation of disease symptoms, or development and cell differentiation or proliferation 

15 disorders. 

5.8.1 EFFECTIVE DOSE 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 

20 LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically 

effective in 50% of the population). The dose ratio between toxic and therapeutic effects is 
the therapeutic index and it can be expressed as the ratio LD 50 /ED 50 . Compounds which 
exhibit large therapeutic indices are preferred. While compounds that exhibit toxic side 
effects may be used, care should be taken to design a delivery system that targets such 

25 compounds to the site of affected tissue in order to minimize potential damage to uninfected 
cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED 50 with little or no 

30 toxicity. The dosage may vary within this range depending upon the dosage form employed 
and the route of administration utilized. For any compound used in the method of the 
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invention, the therapeutically effective dose can be estimated initially from cell culture 
assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC 50 (i.e., the concentration of the test compound which 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 

5 information can be used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance liquid chromatography. 

When the therapeutic treatment of disease is contemplated, the appropriate dosage 
may also be determined using animal studies to determine the maximal tolerable dose, or 
MTD, of a bioactive agent per kilogram weight of the test subject. In general, at least one 

10 animal species tested is mammalian. Those skilled in the art regularly extrapolate doses for 
efficacy and avoiding toxicity to other species, including human. Before human studies of 
efficacy are undertaken, Phase I clinical studies in normal subjects help establish safe doses. 

Additionally, the bioactive agent may be complexed with a variety of well established 
compounds or structures that, for instance, enhance the stability of the bioactive agent, or 

15 otherwise enhance its pharmacological properties (e.g., increase in vivo half- life, reduce 
toxicity, etc.). 

The above therapeutic agents will be administered by any number of methods known 
to those of ordinary skill in the art including, but not limited to, administration by inhalation; 
by subcutaneous (sub-q), intravenous (I.V.), intraperitoneal (LP.), intramuscular (I.M.), or 
20 intrathecal injection; or as a topically applied agent (transderm, ointments, creams, salves, eye 
drops, and the like). 

5.8.2 FORMULATIONS AND USE 

Pharmaceutical compositions for use in accordance with the present invention may be 
25 formulated in conventional manner using one or more physiologically acceptable carriers or 
excipients. 

Thus, the compounds and their physiologically acceptable salts and solvates may be 
formulated for administration by inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 
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For oral administration, the pharmaceutical compositions may take the form of, for 
example, tablets or capsules prepared by conventional means with pharmaceutically 
acceptable excipients such as binding agents (e.g., pregelatinised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., lactose, 
5 microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., magnesium 
stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch glycolate); or 
wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated by methods well 
known in the art. Liquid preparations for oral administration may take the form of, for 
example, solutions, syrups or suspensions, or they may be presented as a dry product for 

1 0 constitution with water or other suitable vehicle before use. Such liquid preparations may be 
prepared by conventional means with pharmaceutically acceptable additives such as 
suspending agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., methyl or propyl- 

1 5 p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer salts, 
flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give controlled 
release of the active compound. 

For buccal administration the compositions may take the form of tablets or lozenges 

20 formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebulizer, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofiuoromethane, dichlorotetrafluoroethane, carbon dioxide 

25 or other suitable gas. In the case of a pressurized aerosol, the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of e.g. gelatin for 
use in an inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g. , by 

30 bolus injection or continuous infusion. Formulations for injection may be presented in unit 
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dosage form, e.g., in ampules or in multi-dose containers, with an added preservative. The 
compositions 

may take such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and 
may contain formulatory agents such as suspending, stabilizing and/or dispersing agents. 
Alternatively, the active ingredient may be in powder form for constitution with a suitable 
vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated as compositions for rectal administration 
such as suppositories or retention enemas, e.g. , containing conventional suppository bases 
such as cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange 
resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. The 
compositions may, if desired, be presented in a pack or dispenser device which may contain 
one or more unit dosage forms containing the active ingredient. The pack may, for example, 
comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may be 
accompanied by instructions for administration. 

The examples below are provided to illustrate the subject invention. These examples 
are provided by way of illustration and are not included for the purpose of limiting the 
invention in any way whatsoever. 

6. EXAMPLES 

6.1 CONSTRUCTION OF TRAPPED cDNA LIBRARIES 

The GTSs represented in SEQ ID NOS:9-503 were generated using normalized cDNA 
libraries produced as described in U.S. application Ser. No. 60/095,989, filed August 10, 
1998 entitled "Construction of Normalized cDNA Libraries From Animal Cells" (also 
identified as attorney docket no. 8535-021-888), by Nehls et al, the disclosure of which is 
herein incorporated by reference in its entirety. 
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Figure 1 A provides a representative illustration of the retroviral vector used to 
produce the described polynucleotides. In brief, pools of modified human PA-1 
teratocarcinoma cells (e.g., PA-2, PA-1 that has been transfected to express the murine 
ecotropic retrovirus receptor) were typically infected at an m.o.i. between about 0.01 and 

5 about 0.1 (although much higher m.o.i.'s such as 1 to more than 10 could have been used). 
Figure IB schematically shows how the target cell genomic locus is presumably mutated by 
the integration of the retroviral construct into intronic sequences of the cellular gene. The 
integrated retrovirus results in the generation of two chimeric transcripts. As illustrated in 
Figure 1C, the first chimeric transcript is a fusion between the coding region of the resistance 

10 marker (neo was used to produce the presently described GTSs) carried within the transgenic 
construct and the downstream exon(s) from the cellular gene. A mature transcript is 
generated when the indicated splice donor (SD) and splice acceptor (SA) sites are spliced. 
Translation of this fusion transcript produces the protein encoded by the resistance marker 
and allows for selection of gene trapped target cells, although selection is not required to 

1 5 produce the described polynucleotides. 

Another chimeric transcript is shown in Figure 1C. This transcript is a fusion between 
the first exon of the transgenic construct (EXON1- the first exon of the murine btk gene was 
used as the sequence acquisition component for the described GTSs) and downstream exons 
from the cellular genome. Unlike the transcript 

20 encoding the selectable marker exon, the transcript encoding EXON1 is transcribed under the 
control of a vector encoded, and hence exogenously added, promoter (such as the PGK 
promoter), and the corresponding mRNA is generated by splicing between the indicated SD 
and SA sites. The region encoding the sequence acquisition exon (EXON1) has also been 
engineered to incorporate a unique sequence that permits the selective enrichment of the 

25 fusion transcript using molecular biological methods such as, for example, the polymerase 
chain reaction (PCR). These sequences serve as unique primer binding sites for EXON1- 
specific PCR amplification of the transcript and can additionally incorporate one or several 
rare-cutter endonuclease restriction sites to allow site-specific cloning. These features allow 
for the efficient and preferential cloning of transgene expressed fusion transcripts from pools 

30 of target cells relative to the background of cellularly encoded transcripts. 
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Based on the unique sequence present in EXON1, that is schematically indicated as a 
rare-cutter (A) restriction site in Figure IB, selective cloning of the fusion transcript is 
achieved as shown in Figure ID. cDNA was generated by reverse transcribing isolated RNA 
from pools of cells that have undergone independent gene trap events using, for example, 
5 RTT-1 as a deoxyoligonucleotide primer. The 3' end of the RTT-1 primer consisted of a 
homopolymeric stretch of deoxythymidine residues that bound to the polyadenylated end of 
the mRNA. At its 5' end, the oligonucleotide contained a sequence that can serve as a 
binding site for a second and a third primer (GET-2 and GET-2N). In the center, RTT-1 
contains the sequence of a second rare-cutter (B) restriction site. Depending on the size of the 

10 pool and the transcriptional levels of the fusion transcript, second strand synthesis was carried 
out either with deoxyoligonucleotide primer BTK-1 using Klenow polymerase or by a 
polymerase chain reaction (PCR) in the presence of primers BTK-1 and GET-2. 

The second strand reaction products that were generated by PCR were digested with 
restriction endonucleases that recognize their corresponding restriction site (e.g., A and B). 

1 5 Additionally, PCR conditions were suitably modified using a variety of established 

procedures for enhancing the size of the PCR products. Such methods are described, inter 
alia, in U.S. Patent No. 5,556,772, and/or the PanVera (Madison, WI) New Technologies for 
Biomedical Research catalog (1997/98) both of which are herein incorporated by reference. 
Prior to cloning, the PCR cDNA fragments were size-selected using conventional 

20 methods such as, for example, chromatography, gel-electrophoresis, and the like. 

Alternatively or in addition to this size selection, the PCR templates could have been 
previously size selected into separate template pools. 

After digestion with suitable restriction enzymes, and size selection as described 
above, the cleaved cDNAs were directionally cloned into phage vectors (see Figure ID), 

25 although any other cloning vector/vehicle could have been used. Such vectors are generically 
referred to as gene trapped sequence vectors, or "GTS vectors" in Figure ID), preferably 
incorporating a multiple cloning site with restriction sites corresponding to those incorporated 
into the amplified cDNAs (e.g., Sfi I, which allows for directional cloning of the cDNAs). 
After cloning, the resulting phage were handled as a conventional cDNA library using 
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standard procedures. Individual colonies and/or plaques were picked and used to generate 
PCR derived (using the primers indicated below) templates for DN A sequencing reactions. 

A more detailed description of the above follows. The btk gene trap vector was 
introduced into human PA-2 cells using standard techniques. In brief, vector/virus containing 
5 supernatant from GP+E or AM12 packaging cells was added to approximately 50,000 cells 
(at an input ratio between about 0.1 and about 0.1 virus/target cell) for between about 16 to 
about 24 hours, and the cells were subsequently selected with G418 at active concentration of 
about 400 micrograms/ml for about 10 days. Between about 600 and about 3,000 G418 
resistant colonies were subsequently pooled, and subjected to RNA isolation, reverse 

1 0 transcription, PCR, restriction digestion, size selection, and subcloning into lambda phage 
vectors. Individual phage plaques were directly amplified, purified, and sequenced to obtain 
the corresponding GTS. 

When selection is not used, about 1 x 10 6 cells (PA-2, Hela, HepG2, or Jurkatt cells) 
per 100 mm dish were plated and infected with AM12 packaged btk retrovirus at an m.o.i. of 

15 approximately .01 . After a 16 h incubation, the cells were washed in PBS and grown in 

culture media for four days. RNA from each plate was extracted, reverse transcribed, and the 
resulting cDNA was subject to two rounds of PCR, each for 25 cycles. The resulting PCR 
products were digested with Sfi and separated by gel electrophoresis. Six size fractions 
(between about 300 and about 4,000 bp) were recovered and each fraction was ligated into 

20 lambdaGTlOSfi arms, in vitro packaged, and plated for lysis. Individual plaques were picked 
from the plates, subject to an additional round of PCR, and subsequently sequenced to obtain 
the described GTSs. The particulars are described in greater detail below. 

Figure 1 shows the chimeric fusion transcript that is formed when the first exon of the 
transgenic construct (EXON 1- the first exon of the murine btk gene was used as the sequence 

25 acquisition component for the described GTSs) is spliced to downstream exons from the 
cellular genome. Unlike the transcript encoding the selectable marker exon, the transcript 
encoding EXON1 is transcribed under the control of a vector encoded, and hence 
exogenously added, promoter (such as the PGK promoter), and the corresponding mRNA is 
generated by splicing between the indicated SD and SA sites. The region encoding the 

30 sequence acquisition exon (EXON1) has also been engineered to incorporate a unique 
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sequence that permits the selective enrichment of the fusion transcript using molecular 
biological methods such as, for example, the polymerase chain reaction (PCR). These 
sequences serve as unique primer binding sites for EXON1 -specific PCR amplification of the 
transcript and can additionally incorporate one or several rare-cutter endonuclease restriction 

5 sites to allow site-specific cloning. These features allow for the efficient and preferential 
cloning of transgene expressed fusion transcripts from pools of target cells relative to the 
background of cellularly encoded transcripts. 

Based on the unique sequence present in EXON1, that is schematically indicated as a 
rare-cutter (A) restriction site in Figure IB, selective cloning of the fusion transcript is 

10 achieved as shown in Figure ID. cDNA was generated by reverse transcribing isolated RNA 
from pools of cells that have undergone independent gene trap events using, for example, 
RTT-1 as a deoxyoligonucleotide primer. The 3' end of the RTT-1 primer consisted of a 
homopolymeric stretch of deoxythymidine residues that bound to the polyadenylated end of 
the mRNA. At its 5' end, the oligonucleotide contained a sequence that can serve as a 

15 binding site for a second and a third primer (GET-2 and GET-2N). In the center, RTT-1 

contains the sequence of a second rare-cutter (B) restriction site. Depending on the size of the 
pool and the transcriptional levels of the fusion transcript, second strand synthesis was carried 
out either with deoxyoligonucleotide primer BTK-1 using Klenow polymerase or by a 
polymerase chain reaction (PCR) in the presence of primers BTK-1 and GET-2. 

20 The second strand reaction products that were generated by PCR were digested with 

restriction endonucleases that recognize their corresponding restriction site (e.g., A and B). 
Additionally, PCR conditions were suitably modified using a variety of established 
procedures for enhancing the size of the PCR products. Such methods are described, inter 
alia, in U.S. Patent No. 5,556,772, and/or the PanVera (Madison, WI) New Technologies for 

25 Biomedical Research catalog (1997/98) both of which are herein incorporated by reference. 

Prior to cloning, the PCR cDNA fragments were size-selected using conventional 
methods such as, for example, chromatography, gel-electrophoresis, and the like. 
Alternatively or in addition to this size selection, the PCR templates could have been 
previously size selected into separate template pools. 
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After digestion with suitable restriction enzymes, and size selection as described 
above, the cleaved cDNAs were directionaily cloned into phage vectors (see Figure ID), 
although any other cloning vector/vehicle could have been used. Such vectors are generically 
referred to as gene trapped sequence vectors, or "GTS vectors" in Figure ID), preferably 
5 incorporating a multiple cloning site with restriction sites corresponding to those incorporated 
into the amplified cDNAs (e.g., Sfi I, which allows for directional cloning of the cDNAs). 
After cloning, the resulting phage were handled as a conventional cDNA library using 
standard procedures. Individual colonies and/or plaques were picked and used to generate 
PCR derived (using the primers indicated below) templates for DNA sequencing reactions. 

10 Total cell RNA isolation was conducted using RNAzol (Friendswood, TX, 77546) per 

the manufacturer's specifications. An RT premix containing 2X First Strand buffer, lOOmM 
Tris-HCl, pH 8.3, 150mM KC1, 6mM MgCl 2 , 2mM dNTPs, RNAGuard (1.5 units/reaction, 
Pharmacia), 20mM DTT, RTT-1 primer (3pmol/rxn, GenoSys Biotechnologies, sequence: 5' 
tggctaggccccaggataggcctcgctggcclUUUUUUUU 3', SEQ ID NO:l) and Superscript II enzyme 

1 5 (200 units/rxn, Life Technologies) was added. The plate/tube was transferred to a thermal 
cycler for the RT reaction (37° C for 5 min. 42° C for 30 min. and 55° C for 10 min). 

The cDNA was amplified using two distinct, and preferably nested, stages of PCR. 
The PCR premix contained: 1.1X MGBII buffer (74 mM Tris pH 8.8, 18.3mM Ammonium 
Sulfate, 7.4mM MgCl 2 , 5.5mM 2ME, 0.01 1% Gelatin), 11.1% DMSO (Sigma), 1.67mM 

20 TdNTPS, Taq (5 units/rxn), water and primers. The sequences of the first round primers are: 
BTK-1 5' gccatggctccggtaggtccagag 3', SEQ ID NO:2 (GET-2, 5' tggctaggccccaggatag 3', 
SEQ ID NO:3), (about 7 pmol/rxn). The sequences of the second round primers are BTK-4 5' 
gtccagagatggccatagc 3', SEQ ID NO:4 (GET-2N 5' ccaggataggcctcgctg 3', SEQ ID NO:5), 
(used at about 20 pmol/rxn). The outer premix was added to an aliquot of cDNA and run for 

25 20 cycles (94° C for 4*5 sec, 56° C for 60 sec 72° C for 2-4 min). An aliquot of this product 
was added to the inner premix and cycled at the same temperatures 20 times. 

The PCR products of the second amplification series were extracted using 
phenol/chloroform, chloroform, and isopropanol precipitated in the presence of 
glycogen/sodium acetate. After centrifugation, the nucleic acid pellets were washed with 70 

30 percent ethanol and were resuspended in TE, pH 8. After digestion with Sfi I at 55 ° C, the 



79 



CA1 -228897.1 



digested products were loaded onto 0.8% agarose gels and size-selected using DEAE 
membranes as described (Sambrook et al, 1989, supra). Generally, six approximate size- 
fractions (<700 bp, 700-900 bp, 900-1,300 bp, 1,300-1,600 bp, 1,600-2,000 bp, >2,000 bp) 
were separately ligated into GTS vector arms that were engineered to contain the 
5 corresponding Sfi I "A" and "B" specific overhangs (i.e., TAG and GCG, respectively). The 
ligation products were packaged using commercially available lambda packaging extracts 
(Promega), and plated using E. coli strain C600 using conventional procedures (Sambrook et 
al, 1989, supra). Individual plaques were directly picked into 40 microliters of PCR buffer 
and subjected to 35 cycles of PCR [at 94° C for 45 sec, 56° C for 60 sec 72° C for 1-3 min 

10 (depending on the size fraction)] using 12 pmol of the primers SEQ-4, 5' 

tacagtttttcttgtgaagattg 3', SEQ ID NO:6 and SEQ-5, 5' gggtagtccccaccttttg 3', SEQ ID NO:7, 
per PCR reaction. The cloned 3' RACE products were purified using an S3 00 column 
equilibrated in STE essentially as described in Nehls et al, 1993, TIG,P:336-337, and the 
products were recovered by centrifugation at 1 ,200 x g for 5 min. This step removes 

15 unincorporated nucleotides, oligonucleotides, and primer-dimers. The PCR products were 
subsequently applied to a 0.25 ml bed of Sephadex® G-50 (DNA Grade, Pharmacia Biotech 
AB) that was equilibrated in MilliQ H 2 0, and recovered by centrifugation as described above. 
Purified PCR products were quantified by fluorescence using PicoGreen (Molecular Probes, 
Inc., Eugene, OR) as per the manufacturer's instructions. 

20 Dye terminator cycle sequencing reactions with AmpliTaq® FS DNA polymerase 

(Perkin Elmer Applied Biosystems, Foster City, CA) were carried out using 7 pmoles of 
primer (Oligonucleotide BTK-3; 5' tccaagtcctggcatctcac 3', SEQ ID NO:8) and approximately 
30-120 ng of 3' template. Unincorporated dye terminators were removed from the completed 
sequencing reactions using G-50 columns as described above. The reactions were dried 

25 under vacuum, resuspended in loading buffer, and electrophoresed through a 6% Long 

Ranger acrylamide gel (FMC BioProducts, Rockland, ME) on an ABI Prism® 377 with XL 
upgrade as per the manufacturer's instructions. The sequences of the amplicons, or GTSs, are 
described in SEQ ID NOS:9-503. 

All publications and patents mentioned in the above specification are herein 

30 incorporated by reference. Various modifications and variations of the described method and 
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system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
the above-described modes for carrying out the invention which are obvious to those skilled 
in the field of molecular biology or related fields are intended to be within the scope of the 
following claims. 
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WHAT IS CLAIMED IS: 



CLAIMS 



1. An oligonucleotide comprising a contiguous stretch of at least about 15 nucleotides 
5 first disclosed in at least one of SEQ ID NOS :9-503. 

2. An isolated cDNA polynucleotide derived from the genome of a human that is 
capable of hybridizing to a sequence first disclosed in at least one of SEQ ID NOS:9-503 
under stringent conditions. 

10 

3. An isolated polynucleotide comprising a contiguous stretch of at least about 60 
nucleotides first disclosed in at least one of SEQ ID NOS:9-503. 

4. The isolated polynucleotide according to Claim 3, wherein said polynucleotide 
1 5 sequence comprises at least one of SEQ ID NOS :9-503 . 

5. An in vitro process for producing a polynucleotide comprising the steps of: 
a) obtaining a polynucleotide template encoding a sequence capable of 

hybridizing to a GTS of SEQ ID NOS:9-503; 
20 b) combining said template with a synthetic oligonucleotide sequence of about 1 4 

to about 80 bases in length that comprises a contiguous sequence of at least 
about 12 nucleotides disclosed in one of SEQ ID NOS:9-503; and 
c) processing the combined oligonucleotide and template preparation such that 
said oligonucleotide sequence hybridizes to said template in the presence of a 
25 DNA polymerase molecule and a sufficient concentration of dNTPs for said 

oligonucleotide sequence to prime DNA synthesis by said polymerase, 
wherein a polynucleotide is produced that encodes at least about 50 contiguous 
nucleotides first disclosed in one of SEQ ID NOS:9-503. 

30 6. The process of Claim 5 wherein said template is mammalian cDNA. 
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The process of Claim 5 wherein said template is mammalian genomic DNA. 
The process according to Claim 6 wherein said templates are of human origin. 
The process according to Claim 7 wherein said templates are of human origin. 
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ABSTRACT 

Novel human polynucleotides are disclosed that correspond to human gene trapped 
sequences, or GTSs. The disclosed GTSs are useful for gene discovery and as markers for, 
inter alia, gene expression analysis, forensic analysis, and determining the genetic basis of 
human disease. 
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SIGNATURE OF BRIAN ZAMBROWICZ (202) 



SEQUENCE LISTING 



<110> Nehls, Michael 

Zambrowicz, Brian 
Sands, Arthur T. 

<120> Novel Human Polynucleotides and the Polypeptides 
Encoded Thereby 

<130> 008535-0027-999 

<160> 503 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 40 
<212> DNA 
<213> Synthetic 

<400> 1 

tggctaggcc ccaggatagg cctcgctggc cttttttttt 

<210> 2 
<211> 24 
<212> DNA 
<213> Synthetic 

<400> 2 
gccatggctc cggtaggtcc agag 

<210> 3 
<211> 19 
<212> DNA 

<213> Rattus Norvegicus 

<400> 3 
tggctaggcc ccaggatag 

<210> 4 
<211> 19 
<212> DNA 
<213> Synthetic 

<400> 4 
gtccagagat ggccatagc 

<210> 5 
<211> 18 
<212> DNA 
<213> Synthetic 

<400> 5 
ccaggatagg cctcgctg 

<210> 6 
<211> 23 
<212> DNA 

<213> Bacteria Phage Lambda 



tacagttttt cttgtgaaga ttg 23 

<210> 7 
<211> 19 
<212> DNA 

<213> Bacteria Phage Lambda 
<400> 7 

gggtagtccc caccttttg 19 

<210> 8 

<211> 20 

<212> DNA 

<213> Mus Musculus 

<400> 8 

tccaagtcct ggcatctcac 20 

<210> 9 

<211> 277 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gtgttgtgct gatgcaggag acaaccgcga agatggggac agaatcagta acatcgacgt 60 

aagggaattg aagcagaaga tcacgctgcc tgcagacacc aggaaacgcc aagacccccc 120 

ttccacgaac caacattctt ccaccctctc caactttttt ctggaacccc ttcacttcca 180 

accgccactc aatgtacact tcactttctc gtgctcttcc taagagagta gtgttttctt 24 0 

cctccccacc gagaaaaaaa ataaaagcaa caactgg 277 

<210> 10 

<211> 434 

<212> DNA 

<213> Homo sapiens 

<400> 10 

cgtcatgttc ctgcaaagag aaaaataagg aaaaaatctg caaaacattg aagactcatg 60 

acccacttta aaaacataac tggatacatc acatgaactc aagaccatga ctatggagga 120 

agatttaaca cttggcaact cttacaacaa caacaacagc aacagggaaa aacaacaaca 180 

acaacaaccg aagagtgcaa aaagaactaa tgcattctct aggtaagcct ggatggagcc 240 

tctaagacct aacaggatgt ctgagattcc agggaagtgg cctgtgatct gtcagtaaac 300 

aaataagaag ctaatacagc tttgttgtgt tttctgattg gcatggttct tgaactatct 360 

cctacttgta gttgcagaca aagaaacagg agatgaatta ccatgttcta ggactttgtg 420 

ttcctttcca attc 434 

<210> 11 

<211> 407 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gttcacaaca gtgttatggc gggagcaggg aggcacctac atccattgga cccatcctga 60 

cagctgggaa ggatgtgtcc agccacccag ggatgtgcat ctggcaccca cctcacaaca 120 

gctgttctaa ccacgtaaga agcacaaggg tcaccggtta ctctccatga gaacaaaagg 180 

ccaaggatgc agagataatt gcatcaaagg gattcaactt cctggatgac ctcattccaa 240 

agatctgcag agcccagata agcatcccag ggttctggca gagggcccct ccagggacag 300 

gaaggggaca ggaagccggc tttccgtgtc tgtaccgcct tccttgggaa ggataggaca 360 

cctgtggcca tcaagtcatg atgccccatc tttctgaaac gaaaaca 4 07 

<210> 12 
<211> 200 
<212> DNA 
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<213> Homo sapiens 



<400> 12 

gaggagaact ggtggcttta taagaagagg aagagagacc aaagcatagc atgtcagcat 60 

gcccagtccc ctctccacgc tataccctgt gccacctcca gacacttcag agaccaggaa 120 

taaggccctc accagaagtg ccccctcaat cttggacttc ctatcctcca tggctgtaag 180 

gaataaattc cttttctttc 200 

<210> 13 

<211> 128 

<212> DNA 

<213> Homo sapiens 



<400> 13 

atgaaggaaa agagggagaa gaaaccagct gcctggaaga ctgaccctct gagatgctct 60 

ggagccgtgc agttgttctc actggcagat cagtcctgtc cctccaataa aagagagggt 120 

gatcttgg 12 8 

<210> 14 

<211> 142 

<212> DNA 

<213> Homo sapiens 



<400> 14 

ctgaaagcaa agaactcttt agatagtgga gtcacactgg aaaaagcaca gacccttgag 60 

tgtactgctt ggaggagagc taccctggag catttgctcc agattctgca tgagcaaaaa 120 

ataaactttt gctgcataaa gt ^42 

<210> 15 

<211> 149 

<212> DNA 

<213> Homo sapiens 



<400> 15 

acacttaatc tggtgttcct gaggctgacc tattggaata tcttgctgaa gaccacgtat 60 

acaagatgtg aacattcatc attatgaggc tgaatgtaaa atacttcatt ttataatgaa 12 0 

gaaagtcagt aaaacaattt ccagcccag 149 

<210> 16 

<211> 166 

<212> DNA 

<213> Homo sapiens 



<400> 16 

gaagaagaan ctcncctcnn catgagaccg ctgtggggat ctggcactgt ggttcctgna 60 

tgcaaacant ggtctggncg tgcctgggcn gacaataccc ctttccgtgt cncgggaaan 120 

gcccncctta aaaaaactga nggngttgaa aaaccagtaa accctc 166 

<210> 17 

<211> 113 

<212> DNA 

<213> Homo sapiens 



<400> 17 

accctgatna ngagaccagc tgaggcgaat tatgagtcaa ctaaaattat ccaaaagatc 60 
attttaccgt aaagtagttg ctgaatgtac acgaaatgtt tagaaattaa att 113 

<210> 18 

<211> 250 

<212> DNA 

<213> Homo sapiens 
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<400> 18 

cttctnctga agaatgagaa cacttgccag ccctttgcct atgttatcac ctggaataaa 60 

ctggatgtgt ctnaatggaa cctgcctcct ttggggagcg catactcccg ccaggtcacc 120 

acagccacca tgaccacctc atgcctccca tccaccfcgtt tcattaattt gtgcctggac 180 

cattttcagt tttctggatg acatgggtga ggaggaggaa actcaggtaa atgataaagt 240 

ttcgactatc 250 

<210> 19 

<211> 387 

<212> DNA 

<213> Homo sapiens 



<400> 19 

aagacagctg aatggttcca gtctttcagt cctgctcctg gccaacactg gacctctcaa 60 

agtctagcca actcctcttc cagcgccttg ataaacaacc ccctcatgct gggaaccaca 120 

gcagtgggct gtttttctcc ctcatgcacc ccaggaagcc tctcctcttt gcctgggctt 180 

tcttcccaag gccttagctg ccaacccatt ttacacccat gcgaagccca gtcagtcacc 240 

tgaagaaaag gagactcaca gaaggcccaa gatgaaagac tctttaatcc tgtggctttt 300 

tgagttttgt ttttagcagg aagaccttat tttcaaaaca aattgttaca cagaatttgc 3 60 

cagtttacag aacagatgaa taaagac 3g 7 

<210> 20 

<211> 216 

<212> DNA 

<213> Homo sapiens 



<400> 20 

gcctaactgn tncaggagtg tctgcttgca tggacaccat tgtggaaacc ttcctccgca 60 

cctgtgccag gctcttgtgg atgccatcaa caaacccctc tgacacctct gacgggagca 120 

tgtgaataac accgaataat cacaacaaat cctcctcatc ataaagcctt gcgnggactg 180 

gcactcgcaa atatttaaat aantattaaa acactg 216 

<210> 21 

<211> 541 

<212> DNA 

<213> Homo sapiens 



<400> 21 

ngtaatnnag gnggangccc cctggtgagg gaactgacca gcagactcca gcagctgtgg 60 

gaaaactcta ctgatgacag gcaagaagcc agactgctca gacctagagc tataaggaaa 12 0 

cctgagtaag ctcgggatga agttatcccc aatcaaccca ccaggtgatt ctgaagccaa 180 

taatttggtc cttggaagtt tgtgctgtat ggaaaaaaat cacccttctt ggctgacatc 240 

tgttttgctg gtaacacaaa tgcaacttat taatcatctc tgggtaagca agaaatgtaa 3 00 

tcctgaaaat ggcttacaag agaaaatctt ggaagataag accgtaacac taaaacgcct 3 60 

ctccagatgc cttaggaaca tccccaagca gtaacagata aagtccctcc ataggattct 420 

tggctatgtt taagtttctc atagaaaaaa ataaaataac naaacncnaa aaaaaaaagg 480 

gcccgngggg ccaattcagn ttggacttaa ccaggctgaa ctngttaaaa aggggggggg 540 

g 541 

<210> 22 

<211> 492 

<212> DNA 

<213> Homo sapiens 

<400> 22 

gacgtctggg gagctcctgc nttaagtnaa acnngaggtt ttngtnngcc cccagnaaan 60 

nngantcggc canacccnaa aaaatcccan cctcaccaag agatgacacg tgacctggtg 120 

ggcctcaccc agggcataca gctttcccag ctagcaaaca aacaagccct ggtcacagcg 180 

gttatagctg gctcatggtc gctcacagac actctgggca tgcattcccg tgacttanaa 240 

aagaggaggc ctttggaacc tgccagtgct gtctgctgat tgtgaggtgt ctggaacctg 300 

gggccccatg gcccctccac accagcatgg tgctctgcaa aggccagctg ctcttcatcc 360 

tgtctcaatg atacacagtt tttttcccca aaactttagt agcgccactc tccctatcac 420 
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tcgtctttta attttgcccc ttattgntcc ttanattaaa aaatatcctc ctttcatngg 
agggttggac ct 



480 
492 



<210> 23 

<211> 273 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gctctgagtc aatacaagta gggaagttca actggttccc tgggtgttca ttcctggttg 60 

gagagctgtt tgggaggctg ggaaggtcca ttagaagcat aattctattc cagaggtggc 120 

ttggcagatg gagcatatca tgggttaatt tctcagcatg tcacagaaag caattcctac 180 

tagacctgaa gaaagtggct tctctcttaa cagaatgtta tctttttcta gagagtaata 240 

tgtttttatt aaataaaaag catctaatag tac 273 

<210> 24 

<211> 495 

<212> DNA 

<213> Homo sapiens 

<400> 24 

attgcaagcc cccacctatg ttggttaatt ctgcttcaca tggaagagac agccattggg 60 

ccagccctga acaaagatcc ctgtcaccaa gatccactgc tcctgctgtg gtcaggcaaa 120 

gagaaggtta tgtctcctga gttctagttc tccgtcctga agtccatgta atgtgagtta 180 

caagccgtct gcagaggtga gcattcgact ctggccagct caagttattc ggcaaggtgt 240 

gattgtccag tcttgaggct gtttgctggg agaagcacga cataggctat tgccagtgcc 300 

aaggagaaca atcctaataa gactgacagc cctgcccaaa tgacatggca ttgaaaatga 3 60 

cacctgactg aatgaanctg acccttgagg taggcacttg ancttnttca aaaaanaagg 420 

gagggaccag ccncaganga ggcatggatc caaacttttg ggatcctcan aaatgtgtga 480 

agtgactcct tcttt 495 

<210> 25 

<211> 468 

<212> DNA 

<213> Homo sapiens 

<400> 25 

attttcctgt agagttagga aactgacaac tagaagacat aaatatctgt tccaactggc 60 

tgctgtactt ctgtgtatga ataaattaat gttctgtttg aaacatcagt ctaagggaga 120 

agagaatgta catgcagata gcctttctat cgacctctat aaccaagacg gcaagcttta 180 

tgaaggagga gatgctgtct catttacaag agccaaaagc agtgttccct aactcttggc 240 

tgagggattt gccatgcagg ataactcata tactatcatg tccttagaga agacatcata 300 

ttcatttgtg ttttctcgga gtaaatttta gtgccgtgat accatttggg tattcattaa 360 

tatttatcac acnaaggaat taaatgggtc tcccgaacct ggcnttaacc tccttgctaa 420 

cctaatattc attcaacaaa tattaactgg gcatcttcaa tggggcag 468 

<210> 26 

<211> 176 

<212> DNA 

<213> Homo sapiens 

<400> 26 

gatcatgaat ggaatgacac actctgaacc gaagagacct tacagatcat ctagttctcc 60 

agccttgaag atggggaaac tgaggctcaa ggaaggcatg taaacagcaa cctcgggatt 12 0 

ccatttaaat tctgcctctc tggatctgct tcctgatata taaaatggta ataacc 176 

<210> 27 

<211> 104 

<212> DNA 

<213> Homo sapiens 

<400> 27 
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actggcatga aatgacagat atacagagga cccttgaaca acctgggttt gaactcctca 60 
acatggacac ttatacacgg atttttctca ataaaagtga cacc 104 

<210> 28 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<400> 28 

gggggggctt ccttncttta gttccgaact gggggggagg 
gtgggtttgn ggaacttggc agcccntttt ttttaccaac 
tncaaaaaca tggaccttna ttgnggccnc ccnttttnct 
ggggccnttg gaccttaaag gnactaaaat ggncaagggg 
agtttgnccn ngtccccacc aggttttttg ntttttaaaa 
aaaaaatctt tccttcaaaa agaccaaaaa ancncgattg 
ccnttttggg gtttaaaaat tttaaaaacc aggnaggacc 
ttcaaggggt tcaaaataaa ataaaaaccn atttccttag 

<210> 29 
<211> 443 
<212> DNA 
<213> Homo sapiens 

<400> 29 

atctcactga agagttcttc tgtgcctgga 
tttcaatggt tctgttgaac atgcaattct 
ggactcgggg gcaacgatgc aattggaaga 
gcatatgaat tatccaaagt gtgggaagat 
aacccaagta catttaatgc tgcgaagccc 
gaagacattt aggacagttc atgtcactct 
caaactntaa agagagctta tgctccccaa 
ttctgaggtg ggactggagg ggt 

<210> 30 
<211> 254 
<212> DNA 
<213> Homo sapiens 

<400> 30 

tctctcctct ggatctgagc taaaagaatt cctgccttac 
gtgggtagaa gatcctgaag ttggtccttg ctccttttca 
ccctctttcc tggctagtgc attacaggca cactaaatat 
gaaattacct tttcctaata tttcctatag gtaattatta 
tgcaaattaa aaag 

<210> 31 
<211> 120 
<212> DNA 
<213> Homo sapiens 

<400> 31 

aatatataac tcgagctcgt gttcctgtcc caggagagag agatgaccct cttcttggtg 60 
ctttcccact ttagttttca tcttccataa tttacgaata aatgcataaa atggaaatgg 120 

<210> 32 

<211> 124 

<212> DNA 

<213> Homo sapiens 

<400> 32 

atctcggaga gaaacgcatc tatcagattt ttactgatac cgaggaagaa gtatctccct 60 
cttcgaattg tattgtacat ttgcattgat gtggttattt tcatctaaat aaagtcaaac 120 



aaacccccan aanttaaggg 60 

taaataaaaa aatctggtat 120 

tnattaaaaa aaccaaaagg 180 

gtggggacca anaaatccaa 240 

taaaccccaa atttgggnca 300 

aaagggggga aaaaatggcc 360 

tncccctttt gngtcctttt 420 

tggattttaa gg 472 



agacttattt tcagtctgag 
cactgaaagc accagatttc 
actgcaccga aaatgacgat 
gcgcccccac tggagtacgc 
cgagtgaggc aaaggtgtct 
gcacagatgc actgaaattg 
atctgtttcc gagccaggta 



aagaatgatt 60 

cgcgtaggag 120 

gtcttctcat 180 

tgaagccttt 240 

ttttatttta 300 

attgnggggg 3 60 

ggatgatgaa 420 
443 



tggaaaaaga gtacagcaga 60 
gaccccaacg ntctcagtct 120 
tgttggtggt gatgatgaca 180 
gaaaattaaa agtagccact 240 
254 
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aggt 



124 



<210> 33 
<211> 373 
<212> DNA 

<213> Homo sapiens 
<400> 33 

gtggggtctt tcaagatgaa atcagagtaa ccccatggag gtcctgagtc acggtggcac 60 
cttgccctgc ttgcctaaca aagacctcct gggaggagga cccagaagag ggcagggctg 120 
aagaagagtc acagctgaag aatgtgactg tttgccagga aagccacttt ctttctgcag 180 
caggattaga attcctacaa ctccagccaa aggaactggg ttgggaagcg atactgcaag 240 
cattcatgtg cttccatcct ggtcttcagc ttagccacgg tcctgcgggg acagtgagtc 300 
cctctctgag tggccaggac ctncacctgg cccacaggaa gcctttacca gcaggaagcg 360 
aaacgggatg ggg 373 

<210> 34 

<211> 480 

<212> DNA 

<213> Homo sapiens 

<400> 34 

tgtcattgag gagaatttgc ctaggagatg caaagagaga gaagcccata ctttgagggt 60 

ggaagcccct ccaacaggca acatgactgc agcacaatca actatggctt tgctgatctc 120 

gtgtatcatc atcctcatca tcctcatccc cgcaattgca gcaaacgtcc agttgtgcac 180 

ttgctgctga tgatgaataa atgtatagaa caggaaaaaa tgtatctcac cttcagacag 240 

aagatctctg ccatcatgtg agagagagcc tgagttagcc tgctggatgg tcaaagatga 300 

gtggtgcagc taagtgaaag cctgctgact tgtagacata tgagtaaggc catgcttgat 360 

cacctggctg ccagctggcc tgccaactaa ttggaggnac ttggaaagan tcnacnaaan 420 

atcacccccc caggtcaaat aaaccccagc cccctccntg agaatgatga actaaataat 480 

<210> 35 

<211> 100 

<212> DNA 

<213> Homo sapiens 

<400> 35 

aaagatgaca gaagaacaaa gatgaaggag gaggccactg gtttacagga agggtaaagg 60 
acaacgacta tccagatttt tcttccaact ttactttaag 100 

<210> 36 

<211> 183 

<212> DNA 

<213> Homo sapiens 

<400> 36 

gcagcaacca cggtcgtaat gggatctgtg actgtcacca gaagaaatca ccaacagttt 60 

cgtatcacgt gagagttttg caggtgcctc caaatgccgt ccatgctcat caacactgtg 120 

acatcagctg cggttcttta atgcatgtga taaggaagca cgtatattag aagtttgggt 180 

ttt 183 

<210> 37 

<211> 144 

<212> DNA 

<213> Homo sapiens 

<400> 37 

aaaggacttg tacctcccag aagttcacgg aagtgctcag gacaacagaa tattgtgagg 60 

ccaacacagc aaacagagca acgatgagca gccacttttg actttggttt ccttattcag 12 0 

gaaataaaac agatgatctg acag 144 

<210> 38 
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<211> 140 
<212> DNA 

<213> Homo sapiens 
<400> 38 

gatctgtaga gagacagcgg aggcaaagat acctggagcc gatcanagaa gagatgccca 60 
ctctgaaatg gacacgccta aggagacatc aaaatcttca ccaaaccctg tctaataata 120 
cagttaaatc aatatcagag 140 

<210> 39 

<211> 442 

<212> DNA 

<213> Homo sapiens 

<400> 39 

gagagaatct aatatactca ttcacactga ggtgtaaggc tctaagaaga tgtacactgc 60 

ctgcccaggg atatatccag ttcacctgga agctaagcaa gaagaattaa aatacagaaa 120 

tgggaataaa gtttgcaacc tctccaaccc tttgttccag gctgcttttt acgcctcaaa 180 

acttaccaga ttttgtctgc acctcccaga caacctcaga aatgtgtttc ccaaaaatct 240 

ctttccctgg tcagtttctc tgtcatgcac tactttcaga aacccagact atcctctggc 300 

ccatcagccc tcatgcccag agacccatgc caagttaaag ttgntcattg ggcancagat 360 

atgtctccaa ggcaccttct aaatctgtca aggccaattt aggaacagaa ggttgaggcc 420 

agatgggaaa agttgggaaa ca 442 

<210> 40 

<211> 414 

<212> DNA 

<213> Homo sapiens 

<400> 40 

gaaacagaat gtctgtggtc angaagttcc ttcfctgggac taaaccagtt gaagctggca 60 

aaatccatga tggcagctta ctcgatcttt gaagaacctc tagcttcatt atactccaac 120 

ttccatacta aatgacactc ccaccaatgc catgacagtt gacaatcatc atgacagtga 180 

ccaaaaagaa ccaaaaaagg acaggaaaga agtggctact tgattccagg aaaatctcca 240 

tcctttccca agaaaagcat gaatattcct ctccttggtt ttaacgctca aacctttcat 300 

taaagatacc ttgtgtctgt aacttcctga ttctcaggag ctgacatgtt gatgtgtgag 360 

ccacactccc acttctcatg tcatgaccat cgaataaaaa ctggtcttgg tttt 414 

<210> 41 
<211> 271 
<212> DNA 

<213> Homo sapiens 
<400> 41 

actttgatgt cttcaaagca aggcgagtga gtggcactct tcagacaaga aggaagatgg 60 
caggtgaaat catcttcttc taatgagccc tgtgctatgc ttgctgatgt ccttggtcac 120 
ggagattttc agaaaagcca tggccttacc agtgaaggtt acacagaggc cactggagtc 180 
aagtaattca ttgctcctta ttacatttag gcacttcttt atccatcatg caggctattg 240 
ggattaaaat gggtcctttc aacaatgagt c 271 

<210> 42 
<211> 111 
<212> DNA 

<213> Homo sapiens 
<400> 42 

ggataactac tggatcagca gtactccaga cagtgcttca ccagactggg tccctggatg 60 
atgaaagagt cccccctgca gtaccacaat aaaaatgtag tgtgaatgag g 111 

<210> 43 
<211> 473 
<212> DNA 



CAl -201292.1 



<213> Homo sapiens 



<400> 43 

aaaccgagac agtacccact gccagcagca gatgggaagt ctaaacagga gagactgaat 60 

aaagctgaca actgaggcag gataaagaag agaaggaaca aagaaggagg gggcaggaaa 120 

agaagccaag cagaacatgc tagcctgtcg attttgtctt ccattaaggc ttcagcagaa 180 

gataagaaaa gctaagccac gtcagtgaag <jgaggacagc aggaaggctt tcaggggaag 240 

atttgtggtg tggattcact cggcattgat gagagcagct ccccagacag ataccgagaa 3 00 

tgaaaaacca aaccagtgac caggaagaga agatatgaag aaaaatataa gtacatcttt 360 

tattgtaaaa atgaataact ataggctata gactggatnn gggaanccta atccctaatn 42 0 

gngatggaat tgggaggngg ggctttggga tgccattatt taataggtca aga 473 

<210> 44 
<211> 429 
<212> DNA 
<213> Homo sapiens 

<400> 44 

gtggggtctt tcacagtcac cagcatcaaa 
tacagaaacc tgagcgggtc anaacgttca 
aaaccaagga cagacagntg tgagagcaag 
agtctggtag agcaaaatga ttttctcctt 
caagacatcc ttacacatct gctgtgtgca 
gcttcttcaa aacaaagtaa atgaaaattt 
atgaaacaat gagaagtacg aaataaatct 
taaaaatcc 



ggagcagtag tggcagcaga 
tcttcatcta gccaaggtga 
ctggcagcaa agggctgagc 
cagcaatgtc agaagaacca 
aaacctgcac acaggacgtg 
cagtggctcc agcagtcggg 
tatatgcttt tttataattt 



gtctcaaccc 60 

aagcacccag 120 

tctgaatttc 180 

tcccttattt 240 

gttctgaact 3 00 

ac tgt taggc 3 60 

agtaacccat 420 
429 



<210> 45 
<211> 489 
<212> DNA 
<213> Homo 



sapiens 



<400> 45 

gagcatatcc tccgttggaa ggaagaaaga agacaaacag cagcctgcat gcttttgaag 60 

ctggactatc aacaaaggat cttctcaatc aattcaccac tagcaacaga atgcaggcgg 12 0 

ttctcagaaa tggctcacaa agaaacacaa aaaaaggntg tctgaangna aaancnagaa 180 

aaggttccct tcnnnaaaan gnaaatggan cnttnancnt ttttngggnn gcagaagtgc 240 

cacggncntn tnantgcggg taattnaaan agggncanaa cactttcttc aggccacccn 300 

agggangttt tatattnccc atataaagan acaaattccc acantgtgcc ttccttgngg 3 60 

tntntccaac tctttgccaa caagaggcca acccggggng ggcccccncc aggggaaaaa 42 0 

aaccttttgg ggngganccc cctttgggca ntgccaangn ccttttgaca tttcaccggc 480 

gggaagaga 489 



<210> 46 
<211> 358 
<212> DNA 
<213> Homo 



sapiens 



<400> 46 

ggatttcaga cnaaattcag ggattcttcc ccncccaaga ctgtggttca gaccacggtg 60 

acgtcttcca ggcaccagga agaaatacga ccaacctccg taacaaatga gagaaacttc 120 

acctgactgt gttttgtgca tttggnttat gagncgtttt aaaaacgtgt acttttactg 180 

ctgcgttcag gttttcagcc atagaatatt ctagaaaaaa atagtgataa catttatttc 240 

accgctataa ccctgaatgt gtagctgtgt tttttaaaaa aacatttttt tacaattgta 300 

gaatatgtaa catgcctcca gaaacgtxjcc ctaaacacaa atatataatt tggcaaat 358 

<210> 47 

<211> 177 

<212> DNA 

<213> Homo sapiens 



<400> 47 
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gaaaagctga agatggtcag acctggtggc acacacctgt aatgccagca cctttgtgag 60 
gccaaggcag gtggatcgct tgagcccagg aattcaagac aggcctgggc aacacagcaa 12 0 

gaccttgtct ctataaaaaa ataaaaaata aaaataaaaa taaaaaaaag atcagtc 177 

<210> 48 

<211> 536 

<212> DNA 

<213> Homo sapiens 

<400> 48 

gacgtctggg gagctcctgc nntanntnac actctgnnag aacccatggc tcatgaatca 60 

cccctttggc ccaaggatga gtacccacag cagcaagctc ttccattgga aaccacgctg 120 

aggaagacat ggtcaagctc tggcagcaga tcaagctgtt atggcaagaa ttcctggttc 180 

tgcgtcccca gcatgtaata tagaagatct gggagtgggg tcttgggtct gtaatgtctg 240 

tgatatggct cctcacatct tcttgtgtag agtgtcatgg ccaaaacagg aataaccgtg 300 

tttgcccttc tgaattcccc agtaatgagt ctgaagctag tctgaagcta ccacagtcta 360 

ttttaaggga ttccataaca tgtttgaatt atatctatat ggnagggact ttcaatcagt 420 

agccaagatc tgntactaaa attaaatncn caatttaatt tccacaagct acatacctcc 480 

cttcanaggc ctgccaaaat tnttaatgga ggacaatgaa agttcgtaac cttctt 536 

<210> 49 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<400> 49 

gtgaggaact gaaattgagc acttgaatgc 
attgtcctct caaagccggt tcacttgttc 
ttcatgtact gggagcacca ctcctgaaga 
gtctgactag ttaggtggaa catgggatct 
gaatgcatgg gcaaatgatc tctggagact 
cagtaaacaa acagcacact cagtaaccag 
cattctgtga ctag 

<210> 50 
<211> 595 
<212> DNA 
<213> Homo sapiens 

<400> 50 

aggaaaggcc acatgaagac acacctagaa tgtgcccgtc tgcagccaag 
tcaccagaaa ccaaccccta ctggcacctt aatcttggac ttccagtctc 
gcagtagaaa tgaggccatg tgactctcca cgctggagga ggacaggcac 
gccagctcgc tcttgcttgt gtgatgcctg cccttggaac ccagccaccg 
aagccaagca gccacgtgga aaggccatta caggtgttcc agccacagtt 
tcccagctaa tagctggcat cagctgccag acatcacacg gtgagggaga 
ttctagcctc cgcccctgga tgctccaact ttgaaccagc ccacctcact 
agagagaatt gagtattatt gctgaactct gcccaaagtg cagtttgtat 
tcttccctta ttttaaagtg ataacttttt ggagagactt ttttacacaa 
atggaacaaa tactacttat gatttttgca gagtaaatcg gcttctcgct 

<210> 51 
<211> 268 
<212> DNA 

<213> Homo sapiens - 
<400> 51 

gagattttca aacctcagta tgactgaaaa tatacttcag aaagtcaaga cctgggccta 60 
ggagtctgca ttaaaaacac tactctgggt agagataaag aaagggactc tctgagatga 12 0 

gggaaaagca gtggtttcta atctgtggtc cagagatctc tgtcgggatg aagaatatgg 180 
agggagaaac aagagttatt gtaaagggtc tacaaagctc tacgtatgca aagcactatc 240 
tatagactga ataaataagt cttgcact 268 



ctggaaccac atatccaacc 
tcaagacact ttatgtcgag 
agctgactca gcttcaatgc 
gtaaagcatg gtgctgtgcg 
ctagcaatca ttccgaagtc 
tattcttgta aagatggagg 



aatggcagcc 60 

ccacagctac 120 

aaggaagaaa 180 

agaggtggtg 240 

tgtgttcaag 300 

atggtaatta 360 



aagaaaggcc 60 

cagaactgat 120 

tgaggcttcc 180 

taccgtgagg 240 

ctcatggagg 300 

ctgcacaaga 360 

tgagtgccgc 420 

gcaaaatact 480 

caagtagata 540 

tttcc 595 
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<210> 52 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<400> 52 

atatttcgct ctgaagaaac atcattagaa ataaataaat aaaattaaca tataatacct 60 

<210> 53 
<211> 419 
<212> DNA 

<213> Homo sapiens 
<400> 53 

tctcaatacc ttcacagagg tgaagaagca gcaaccaaat gaattagaca gcaacatgat 60 

tcctagagaa tggcaagacc aattcttcaa ctacttcttc agcatttctg aaacatatgg 120 

aagatggccc attgtgctct cttaattctt tgataatctg gacattgact tttccattat 180 

atgacctggg cttgtgggca tcatgtcata atgcacctgt tcagacatct ccctgtacca 240 

atatggatca cttgaagaga ctcctttgcc tccatcaaaa aggatacagn tgtgtatctc 3 00 

ttccattttt gnttacagng cctaaaatta tttgagcagg ttttcacctc ttctctgaat 3 60 

aaacacctta ttagtcctta aaangaaang aaaaagggaa aataaaactt ttaaatgca 419 

<210> 54 
<211> 450 
<212> DNA 

<213> Homo sapiens 
<400> 54 

ggncgaggca gaaccaaacc atggatacgg gtcctttgct caaattcttc tcaatgaaga 60 

ctctgtgatg aagaggccac ttccatttaa aggcagcgac acttagaaaa tcacaggcat 120 

taaaacttag aagaggtcac cttatccaac gtcccagcca gcacagccat cctttcacag 180 

catccatgac attcagcctc ctctcagaca tgggaagatc acctcttcat gaaacagcag 240 

attcttcaag gataaggaaa tggaggaaca aagcagtgaa gtaatctgtc caaagcccaa 3 00 

aagttgaatt gttgaaactg acatctgaaa gcaagtagcc tggcttcaga gtatatgctt 3 60 

ttaatcgctg tgttatatac tgcctcttta tatgtgataa tatagtatat ttattaagtt 420 

attaaaagaa acataagttt ctttgttgtc 450 

<210> 55 
<211> 172 
<212> DNA 

<213> Homo sapiens 
<400> 55 

ggactaagga ccactaacaa cagatccaag aacacatgta atgcaaacca ggtattcata 60 
tgcctctgac attttcaagc ctaaagatca agagccatca tcttttacaa gagttgcagt 120 
ttggtcttaa cctccaaaaa agaaacttct aataaatact atttccttct gt 172 

<210> 56 

<211> 211 

<212> DNA 

<213> Homo sapiens 

<400> 56 

agagtttgtt gctaaacatt taccagcaca ccctaaagag aggagaaaaa aatatgtgaa 60 

gaaaaagaaa aaaggagaaa tcaaagaaag agaaagcaaa aagagcatat ttggatgtgg 120 

aagaagaaaa agacaagttg aactgtctta aattccagcc catgaaagcc ttcctttttt 180 

taaataaagt ttttgttttg ttttggtctg g 211 

<210> 57 
<211> 328 
<212> DNA 

<213> Homo sapiens 
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<400> 57 

taccatggtg tnttgaatnc agcttngctt 

aatttcaaat ttccagcctc cagaactgct 

agaggaggat atttgagaga aggggaacta 

tacatagaag atgatcaata acacttattc 

gttctctctc gggagtgctg ttttctttcc 

agcgataaat aaagaggaaa agaattcc 

<210> 58 
<211> 208 
<212> DNA 

<213> Homo sapiens 
<400> 58 

gagttggttg ttaaaaagag cctggaatct ccccgtctct ctctggcttc ctctctcact 60 
catgtgatat ctgcacttgg aggctcctct tctctttctg ccatgaatga aagcagcttg 120 
agaccctcac cagatacaga tgctggtgcc atgctctctg tacagcctgt agaccatgag 180 
ccaaataaac ctgttttctt cacaaatt 208 

<210> 59 

<211> 334 

<212> DNA 

<213> Homo sapiens 

<400> 59 

catatctcaa aaatcaagat gaanccttaa gctttctacc cagatgttgt gggaacttga 60 

agacaaagtc tcaaagagac tccgttttgg tcaacaatta gcccttccac atttggatcc 120 

tgggccacat gtggaaataa agagttccag aagaattctc ccatgaaggc attggaatgc 180 

ttcaatacat agttttgtgc caaatctaca ataatcttcc caaaagaaag actcttcagt 240 

gttctggatt tttcgggact tntcttattt tcttgtgcaa catcttaaca caaactagaa 3 00 

taaagatgac atataatcat ctgcattcat gaat 334 

<210> 60 

<211> 177 

<212> DNA 

<213> Homo sapiens 

<400> 60 

aaagctggtc gttaaacatt tactaaaaca ccactggata caagtgacat catacaagat 60 

ccagtccctg caaccactga tctgcctcct ccctctatgg cgtcacctgt ttggaacatt 120 

tcatgtaaat ggaaccatac aagatgtgac cttttgtgac tggcttctct cacttgg 177 

<210> 61 

<211> 381 

<212> DNA 

<213> Homo sapiens 

<400> 61 

ctgcaatgtt cctagagaga agccagcact cgccagatct ttggccaccc cgaggtgtcg 60 

tgtgcataag ggaagatgag aggctggttg acgcccaccc ttcaccagtt ttgtaaataa 120 

caagctggcg ccccagaacc catccacagc agctttttca gtggcattat gcattcgtgg 180 

tgcaagcatc cttactgtgc ttcaatcagt ggcttcagtc gtggccggcg cacactgatg 240 

gagtttcttc ctcgtcgcgg gtcatatttt cctctttgca tgtctgatga cttttgatta 3 00 

gatgcaggcg ttgttcactt tccctgttga gttctgagta tatttgcatt cctattaaat 3 60 

atccctgngt tttgctctgg g 3 81 

<210> 62 
<211> 141 
<212> DNA 

<213> Homo sapiens 
<400> 62 



tcaccaaaac ccgatcatgc 
ccaagaaatg gaattttatt 
cctaatactg aaaactaata 
aatctaaatt accctatcag 
tgccagtcag ctctgtcagg 



tnggcaccct 60 
aaaagatgga 12 0 

cagtccagga 180 
caagtggaga 240 
ttgaatagaa 3 00 

328 
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gaaataaggg accctggcat ggatggagca tgtgaaacta tcaagaacag tgaaatgttt 60 
cagatttttg ctatttgcca gtttcgtttc atgaatgctg gcagaagacg cctgaatcaa 120 
agataaaggc tgtttttact c 141 

<210> 63 

<211> 581 

<212> DNA 

<213> Homo sapiens 

<400> 63 

atgtgcagcc tgtcaccaac accaggaagc tcagagacgt gccacctgga aaggaaatca 60 

gacaggagag ctcagggtcg aagtcggccc ggccgcctgg agctccaagg ggacaaatgg 120 

agcccaggtt caaccgcagc cagggaggca acgtctgtgc acctgcaact tcccatggca 180 

ttgcccaact caatggctca agaacctgcc ctgtcctgct tcgggcccag cattccatcc 240 

tctgaaagaa cacgagcgtc cccacatgct ccgtagggac catcctgcct ctgccctccc 300 

cacttcacca gaagaactcc tcctcatcct tctgggccaa cttggcagca actcctccgg 360 

gaagccttcc ttgctctccc aagacacgga caggcacccc tcgtacgtgc caatagcatt 420 

cccatcagca gttgtcacac acacaaggct catgacctcc ctccccacct gtgcccccag 480 

gggaggggct tncttggggg cagggccatt tcgtcgtcat cttccagcac cacacacact 540 

cggtttgctg aatgcttnct aaataaattc ctgccaaatg c 581 

<210> 64 

<211> 244 

<212> DNA 

<213> Homo sapiens 

<400> 64 

atgtcatgtt ggagcattgc agactgctct tctcccttct gcctttacat acaagatgcc 60 

tgttgctgag aacacttgtt cccacttctc tagcaggcaa ggatctgggc aggacaacaa 120 

ccacaggcat gtgctttctc atcatgtgat gtcatctgcc aggtcatgat gcagcaagaa 180 

ggccctcacc agatgccacc cctccagaac catgagccaa ataaatgtct gttctttata 240 

aatg 244 

<210> 65 

<211> 362 

<212> DNA 

<213> Homo sapiens 

<400> 65 

gaaactctcg aagggtcctg cctcagggtt gttttatcca ctagctgctc tagacacagt 60 

gcctgtggcc ttccagctat tcagtcaaca gcatatgaaa atgcagttca ttaaaagtaa 120 

accatccaag tcacctgttc actgtggctt cctgtcagga gggacagttt agatgacttc 180 

ttggagcctg tcaactcgta ctgcactgat ggtatcagat gcaagctggg gaatttggaa 240 

tgctatctgc aatagtgaca tctggtggct tctaagttct actgcacctc cttaaggcag 300 

gaaagcaagc ctggctttta agcagtattt gtgaaaaaat aaaggaatta catgagttct 360 

93 362 

<210> 66 

<211> 418 

<212> DNA 

<213> Homo sapiens 

<400> 66 

ggtctatgct acaccacctt ntgcttacac cgaaacaaaa gcggntggag ggagctgagc 60 

ccagagaggg atgatgcagg ctcttccaga acctgtgtcc tatgcctcaa gccttctttc 120 

cctcctgctc gctgacaact gctgaagcag aaactaagat tacgacacta ggtggcagca 180 

tnatcccacg ggaagacaac ttgagtttgg ggagaccacc ccccgccaaa ctcaacacaa 24 0 

tttggagagg ctccacgaaa aagaccagcc cccaaataac agggagactc tgcaatgctt 3 00 

ggtttccagt gatgatcaac actttaaggg ccaatggaat tcacccttac aggggaaagg 3 60 

ggaccgttga antancctgg ggnggggagg ggcatgctcg agaaacccta cctaatgc 418 

<210> 67 
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<211> 322 
<212> DNA 
<213> Homo sapiens 

<400> 67 

catggagcct agtacaaaga aaatatccaa tgaactgaat ctctactctt ctctgaaaac 60 

tcaaaagatg agtaaaggaa agtctgctat tttcagagtc cacttgtcct gagctggttt 120 

tcttctaaac cacatcacaa aagagcacga tgctgtgaac ctctcctttg gactcaagtg 180 

tactaatggg gaggaatggc aagttacatg cattatttct ggattctata aaaatgaaag 240 

tgatgggaat taaaaataag ttcattaata ttgtaattta tagttctgaa gagctttagc 300 

aaataaacta aacattccaa at 322 

<210> 68 

<211> 317 

<212> DNA 

<213> Homo sapiens 

<400> 68 

ggtgcfcttac gtcccaccca aggcaagagg aacgccagcg aggaagacaa agaggcccgg 60 

ggtggggcgc atgcccgcca ctggactgaa agctgagtca caggaatcgt acccctgcag 120 

cgggccaggc cctccaggga gggacaccgc gcccttgtgt ggagatgtcc acagtagaca 180 

aaggcagttt cgaaataaaa gaatgcctgt caccccaggt gccaccccga cccttagtta 240 

ttatgcactg gtccccaaga gcaatttctg cgctgctgtt gcaaaaattc atcgtaatga 300 

aataaacgta aaagggg 317 

<210> 69 

<211> 678 

<212> DNA 

<213> Homo sapiens 

<400> 69 

gactctgggg agctcctgca ttanatnana nntgnngata tcnactctaa nagacatnaa 60 

ggaggcacaa aagtcccatg ccgagagaga agtcggtaac tacgcctgtg accgggagag 12 0 

gccggacttg ctctccttcg cctaggtttg cactcagagc aagagagaat ataggagaga 180 

ggaagagaga aaggtaccgt cctgacaggt actttcctgg ctatcacaga aagaacaagc 240 

ctttcatggt ttattgggaa ccaagctcag gtgtccctgg aggcagagct acgtggaccc 300 

agcaggcaga agagaaaaga gccctgaacg ggaagtgtga gacctgtgtt ctattttgag 36 0 

ctttgcccca actgttaaga ggactgacca tttaacaagg gggagctggt gagatgactg 42 0 

gacactttga agtgacaccg ggacccaagg gttctcaagt tcattatttg tgaagaaatg 480 

gngcttgntt ctgtgatctt tctctgctct gaaatactac aggccttaan ctagatgccc 540 

tttggaggnc tttcctggat caacagatgg aggacttttc aaaagcagac gaaagtgaat 600 

gggatcactc acacctctgc ttcggacaca gngaagccca gatggagaag aaagaaaact 660 

tggncaaagc tatacttg 678 

<210> 70 

<211> 257 

<212> DNA 

<213> Homo sapiens 

<400> 70 

gacacaaatc caggagccat tccttctgcc tgggaggagg gagtgatgaa gaccagagga 60 

atcccagagg agaagccatc tgagatcggg aggaggagaa atggaacatc aggcggagga 120 

aacagcccag acaatcgcac tgggacgtga aaacccttgg gctgcatgcg gggagaaaac 180 

cagaattggg gatggttagg gttttggagg gaaacacagg gacatgtgac caaaaataat 240 

aataactact gttactt 257 

<210> 71 

<211> 491 

<212> DNA 

<213> Homo sapiens 

<400> 71 



CAI -201292.1 



gtaacctaat gggtttctca gccaagccgc aagcatgtaa 
atgtctttag agacttagaa aagaccagca agcttcttta 
tggcatccac ttgggaccaa tgagatggga tgttcatcct 
tctttaatgg tatccccagg agcctctgaa gtgcatcagg 
acactaagcc aaaacaaggt atgccctatt caatttcttg 
tgctgtccaa ctgatcacac tagctgaagt caaaaatgtg 
cctacttatc agattggcaa aaannaaacc aggtcataaa 
ggaaaaaaca catcttttta tatgcattgn catatataca 
atatatatac t 



ctgcaacttg aaggaggaag 
caaaatggtc tcttcaatcc 
catagatttt cacatatgta 
actttatttc aatgaagttc 
tgtcccatta cactcagctt 
caccagaaaa taaaatgagg 
accccttttg gtaaatatat 
tacatatata tgctgcatta 



60 
120 
180 
240 
300 
360 
420 
480 
491 



<210> 72 
<211> 196 
<212> DNA 

<213> Homo sapiens 
<400> 72 

ctaccagtct gaccctgact caggcctccg gaagaaacca ctcgctaatc acagtctgtc 60 
ttgcacccag acacggcatc tcagacactg cacaaattaa gaagtcaccc tcaaaacctc 120 
tatacagtgc aggaatacag ctaagacacc acacccgagt actaacatct gcaaattctg 180 
aaaagctcct cataat 196 

<210> 73 
<211> 511 
<212> DNA 

<213> Homo sapiens 
<400> 73 

aaaaacagag atctgtgttc tgaatggaaa aattcctact gatgccaccc actagtctgg 60 

aacaagtcag tctcaaacat aacaacagac actggggagc tctccaacaa aagatcacct 120 

cccaaagaac aggatggtgt cgaagactga atgccagcct gaggaaacag aaatactaca 180 

gaagcacgcc agagcctgca gtgtctcctc gctgcctctc aatgaactgc taaaagacca 240 

agaactctgc tgagagataa gaagagggga gggtgtgctg caggtggtgc tgggaggccc 300 

agaccttctc ctgacatctg gggctggcta caggaaacag aaacatcacc caggccttgg 3 60 

cgccgagaca ggacagaagc agattgtgac tnaaatcttc nggnnggaaa ggggggcctt 42 0 

tcnttttntc cttaggggnt anaacnaaag cccanaaggg ttcatccaaa ggnaaccctt 480 

aaggcagttt natgatccct ttcaaccttt t 511 

<210> 74 
<211> 499 
<212> DNA 

<213> Homo sapiens 
<400> 74 

gactttgcgt gtgaccactg cacctccagg aaggccaggt gcacatcgct tcccatgcgc 60 
ccggcctcat ggcctttggg ggttgtcgtg tggaatggag atgacacgag tgctgcatgt 120 
gaggtcagtc aggatctttg attttggagc acaagccttc tgcgtgctac tgactgggtc 180 
ctggcctccc tccttccatg gcatcgtcgt gaatgggaat ttccaccact gcctccatta 240 
gcttgaaaaa agttctccac agaagtaatg accctggact tgcagaagag agcgctaaag 300 
ctcagaaagt aaagtcagct ctcaagaaga cttcgctagt aattagcgaa gtaggatccc 360 
acccagatct gcgttctcca cctgntgnca catgaagcng gggnggtnaa aacagaccng 420 
ggaantggnt acctcattac aatgccccnc tgactggtnt aanttcccna naggggttat 480 
tggccatttt gttcaatga 499 

<210> 75 
<211> 427 
<212> DNA 

<213> Homo sapiens 
<400> 75 

gaaaaaagta tcagaatgct ttctacatga acaggaagac taaccaacgt tgaatggcag 60 
ccagtcttat ctccgtcctt atcaccacct taccatgtca tcctggcgaa gatgccatca 120 
caggagtcag ggttgaagtc caggtttaag gtgcatctag atgggttccc aggacgcctg 180 
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aagtagcctc aagaggccca aaagaaaaag 
aaggagtccc tctcatctgc ttggcagcct 
attatttccc actgaaattc aaacctaatt 
aaacatcact gnggcaagta acttgctcaa 
tggattc 



ctcctctggc acagtctcct aatggtgaca 
tacaatcaga gcgttcttac atctaaccta 
catttatttt ttattctcta taaaaatgaa 
tttctnacaa aaaaataaan aaaaggtggt 



240 
300 
360 
420 
427 



<210> 76 

<211> 286 

<212> DNA 

<213> Homo sapiens 

<400> 76 

gtggggtctt tcaatggaaa gatgctcagt tgagtgggga agagagcagg aatcagagtg 60 

tcaccatgca ncttatgcaa aatagttgtc aagctggaag gatgcaagcc caatctttgc 120 

caccacaaag gaagataata aaacccatac gggagaaaac agagccacag atggagacag 180 

tcacattcct ggtgacagtg tttgagcacc tggatccagc ccaacctgag gccattttct 240 

cctaggcttt ttagatctgt gaaccaataa atccccgctt taggag 286 

<210> 77 

<211> 279 

<212> DNA 

<213> Homo sapiens 

<400> 77 

cttcatctct ccccgttaca gaccaggaat tccaaattcc tagcccaagg tcagagaggt 60 

ctcactgatg cctgtgtagc cacgtgagga tgggaagtct catttgccag taagcactac 120 

aggaagtgat ggttgaacac gatgggacta ataagaagga aacgtagtta gagtgatctt 180 

attcatttaa aaacaaaagc agcaacaata cagcagtcga ggaaaagaat caattctatt 240 

taagcaaagc aatttaaagt aataaaaaat gtttccagc 279 

<210> 78 

<211> 481 

<212> DNA 

<213> Homo sapiens 

<400> 78 

ctgctggttg gtttgaagag aagtttagtg ctctcaacag caatgaacag cattgggtca 60 

atattcagtg gccgggagac aatctgggtt actacgtatg ctgctttgtt gtgaactgga 120 

attggcatca tgtctccaac attctgaagc caaggctgag gatatacaag gtctggaatc 180 

attaagggtg tgataaagtg ctgagaaaca caggagaatg cattgttcag tgaatgaaaa 240 

ttgaaaagag agatggagac agacaaagaa aaagactgaa caactgaata gccaattttt 3 00 

tttaactctc aggatgtttt ctcctacctg gatggacaca attttctgtg gnggtacatg 3 60 

ataagtattg gctggggtga ccattccatt tnctggnccg cccaaggana ttttgnaang 420 

taacanaaaa gggccatnat attttccttc tctaacctgc cttggancaa gccctaaaat 480 



<210> 79 

<211> 200 

<212> DNA 

<213> Homo sapiens 

<400> 79 

agagctcaca gcatcctgtc tcctccagaa gctcttcccc agctgaaatg gaagtgaaag 60 

actggtagtc tcctcctcaa ccacccacct cctggggccc tgactgtgtg gatgaactcc 120 

tcacacccag gatttgtgtc tccagtgaaa agcagcaatt tatcctacac tgaaaatttc 180 

ctgaataaaa acagttcacg 200 

<210> 80 
<211> 239 
<212> DNA 

<213> Homo sapiens 



g 



481 
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<400> 80 



caggagcatg caacacctct tggactcgat gaaagctgtc 
agtactctga aagagcatct tgggggaaaa aaaagcgtgt 
cagaaagtga agtctcgcaa aggaaaaaga caagactaaa 
tgggcttttt cttccactca gcatctcttc ccttattaaa 



gccacaggtt 
cagacattca 
gggaataaac 
atgagaggga 



tcaaccagtc 
tcttcataac 



catcgttgtg 
taacttaag 



60 
120 
180 
239 



<210> 81 
<211> 495 
<212> DNA 

<213> Homo sapiens 
<400> 81 

cccttcccgt cctcccgctc cccagcaagt cagaagcaga aggcttggtt gctgccagcc 60 
aggcaaggga cagcctccag cagagtccac ccacccacag ttgtctcctt aggacaaaca 120 
gaaagtttca caagcacact ttgttcagtt ctgcagctta ccaggaacac tagaaagcac 180 
tccagcactg tgcctggggg ccatttgaaa cagcaaaatc atcaacaaaa accacaaaaa 240 
tgcaaaaacc atggcactaa atagaccatg aaaaggacac ctgtttactg catgacctga 300 
aacaagaagg cggagcgttg ccttgttcga cttcagctgg gaagataggc gtcaggggac 360 
tcaaactttt cagcactctg ttatatctgn gaatgatcac aaaaaaactg gggagtntta 420 
tttttggggg ttacnaataa atttttacca agtaagcttg nttcacaaat acanaattnt 480 
ggggataatg aaaat 495 

<210> 82 

<211> 98 

<212> DNA 

<213> Homo sapiens 



<210> 83 

<211> 486 

<212> DNA 

<213> Homo sapiens 

<400> 83 

cgtccacagg atgtcgggcc aggagagctg aaagccaata ctgatgagga agggccaagt 60 

gaggaagagt ctgagctgca tatgtcaaga aggagaaagg ggaaagaagc aaggagcgag 120 

accagaggga gccacgcaga aacctctggc ctctctgcac gtctgtctta tcctacagag 180 

tggcgactct aaaaggccaa gggtgccagc gcccagcgac agttcacagc ctgagacacg 240 

ctttgctcac acgcctccct cctcctctgg ctcctacctg ataaaaagca ttaccggttt 300 

tgatgtttcc aacctccccc attttccctg gtgaaagatc cattcatttc agtgctaaca 360 

agacatcata agcagggaga aggaacaaaa ggcanantgt gtncttaagg agggaggcan 420 

tttgcaaaag cnccactntt ttcaccttgt ccacagaata aagggttgaa gactaaaaaa 480 

aaaatt 486 

<210> 84 

<211> 280 

<212> DNA 

<213> Homo sapiens 

<400> 84 

ggtctgcacc tggagactcc cacctaagat gggggtttag atganaccac tntgggagga 60 

cacncantcg agtgtggagg ccccgaggaa gatcanctnt naanacacag gcaggcaaag 120 

ggcagacctc taaggagatg gangangaat gacanagggc nngaagaatc ntgtgaggga 180 

ctgncanana agccagtgac naaaacttnc agaagagctg ncaacagtac caaacaaagc 240 

agaagagtct caaaagatta aaaataaaat ttgcttccat 2 80 



<400> 82 

gtaacangaa tgaagaaact acaaagaata ttgagaagga 
aaaaccagga aaagatggct catggaagca aagaaaac 



agcatcacag 



aagtgagagg 



60 
98 



<210> 85 
<211> 408 
<212> DNA 
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<213> Homo sapiens 



<400> 85 

atgaggagac ccaagttccc agaagagcag ttgcacactc gaggctggag gacatgggca 60 

gaaccagagc tccttgcctc cctcccagcc ccccacccaa gtaacacgtt cctgatcctg 12 0 

tcctggaagc agcttcgagg aaatgcccag acccctgggg ggtgatgtgg tggcaaggtg 180 

acaaaggggc aggtcacaac gctgtcacaa gctgatatgc aagaactcac aggcatgacc 240 

cccaggggct atgggtgtaa gggcatctgc tctgcccttt ccagcgggcc tagttttggt 300 

ggcctctgtt ccatttattt gcttaggaac acaaagctga atgcactgtt tgcaggaagt 360 

tgtgtgtcta agtcacctaa gttagtaaaa taaataaaaa ccttttgg 408 

<210> 86 

<211> 477 

<212> DNA 

<213> Homo sapiens 



<400> 86 

acatgctgct cccaaacagt gcctttgaat caagacccag tcatcgtatt cgaagaaaaa 60 

ggaaatatcc ctgaccatgt tgggacttaa cactgcttca cagagctacc caaaccaagg 120 

agaataccaa cgtgaattgt ctttccacct gttgtgtggg gccagcaatt attcttttag 180 

cttgacgcgt taacccacct gctccctgtg gccctgggat gctctgccat cccccgtggc 240 

tgccagttca cttagggtag acttatggca gagggatgtc aattttgctt gaactgctca 300 

atcactgctg acatttcgtt aaccacccta tgaacttctc aagcctgaag tagcagcaac 360 

ttgtgccctt gaaaactgaa cagaaaacaa ctggattgna ttttttcttt caccaggaaa 420 

aaagacaatt ttttntttgt tganaangtc ataaaggcat tttacccact tattttt 477 



<210> 87 
<211> 500 
<212> DNA 
<213> Homo 



sapiens 



<400> 87 

cttctcttat tcctgactct ggctgccatc gttggctgat gaaagagttc cttttatttg 60 

gtgagttcat ccatcaagat tgtcttcgaa gctttgtctt tgaagttttc acctattccc 120 

aaccactccc cctggaagct tgtttcctgc actgttaaga gcatggaccc tgaaggcgga 180 

ctacctggat tcaaacccta cctccacctc ttattgggag aatgaccttg tgtaaatgac 24 0 

atcacttctg tgtctcagtt aacacgcctg taaaatggaa ataatatcta tttgtgatgg 300 

ttaagtttta tgtgccaact tgactgagtc agagaatacc gagacagcag gtaaaacatt 360 

atttctgagt gtctatgaag ggtgnatctg gaaaaaanta cntttggaat ccgtngaaaa 42 0 

ggggcaagna anatctgggg cggntcatct gggnatcatc caatccactg gagggctcac 480 

ccaaatagaa caaaaaggct 500 

<210> 88 

<211> 381 

<212> DNA 

<213> Homo sapiens 



<400> 88 

gacactggag aggggtaagc atgctaagaa gtgagatgga tttaaccagc aactcacggc 60 

aaagtgcgta tagctgcgtt tgagaaggct tagtcatgac tagaaaagtg tgaatactgt 120 

gacatatcct tgcaaaaaaa tgttcagctt aagcctctan actaacttct ggtttacaag 180 

aanaaaaaag aggggcccat ttccaaaaag actcctgcct tgaactcttc aaaatgccna 240 

tgncacaggg ggaaaaaaga tgggggaact ctactacntt aaagctaaag aaaaatttna 300 

aaaaaaaaan gaaaaaaagg gccngcgngg ccnattnagc ttggacttan ccaggctgaa 360 

cttgntnaaa agggggggga c 381 

<210> 89 

<211> 458 

<212> DNA 

<213> Homo sapiens 



<400> 89 
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gtcacaactt ccatagtcag atcctggaag 
ataaccttcg gagaagagag atgctctcgg 
cacttatgtc gagactgcaa aggctaacag 
ttcgctgcaa aacaaacgaa aaaacaaagt 
tcaacacact acccagcttt ggagccaagc 
agatactgcc acctgctcca agtgtcaaat 
ttaattcagc catagacagt caatctggga 
tttataaaac aaaaactgca tgaattatca 



cccacttcaa gcacagcata ttattaacaa 
tgccagtggg ggaagaaagg actatactta 
catcttcatc ttgggtgctc tgtttccgct 
tcaaaggcat gcagccctct ccagtccaat 
ctcatgagtt cccccaaccc agttcctgcc 
ccagaagaca aatggcctcc aatggtcttt 
tagaatgatc tccttaagga acccacatgt 
aaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
458 



<210> 90 
<211> 227 
<212> DNA 

<213> Homo sapiens 
<400> 90 

gactctgggg agctcctgca ttaagntana nctgatgact ccagngaccc ttcatgagaa 60 
gaacatgtct gcggtagcca ctggtccaag gagaatgagg aaatatgtag agcagctttg 120 
aacctaatca gcagtctgaa gtcaagccca gtggattcca gccaagcaca gcagaaccac 180 
agccaatcta tagaactatg agagaggaaa taaatatttg tggctat 227 

<210> 91 
<211> 256 
<212> DNA 

<213> Homo sapiens 
<400> 91 

gcctctattt accatcccca ggttggaagc aaatgtcaga gagaccagag gaaaccgtgt 60 
gtgttttagt gggtttattt ggaggggcat gggctggaaa ggagcgggca gagatgcagg 120 
gcaaatctat aaaacatttt gaacttgtgg cctataaacc accaaacatc atgcaggtca 180 
ctgatgtgag gatctgctgg gcttatggca tttgtgacaa acccaatgat tcttttatta 240 
caacagctta taaatg 256 

<210> 92 
<211> 305 
<212> DNA 

<213> Homo sapiens 
<400> 92 

gattgggacc agctcatctg aaaattgatt gccggacatg gagaacaaac tggttcagtg 60 
ttaacgagga ggaacggatt tgtccatctg accacaaccc aaattgcttg aaaatttggg 120 
cagctgtgtt aacagggaaa gaagttggga catggagttg gacagacctg gctttgagac 180 
tctgcctcat cacgacctcg ctgtgtgttc cctctgaact tagctttcta tattaacaaa 240 
atgaggccaa taataattcc accctgtctg cattccaggg caattaaaga atcataaatt 3 00 

ggcct 305 

<210> 93 
<211> 190 
<212> DNA 

<213> Homo sapiens 
<400> 93 

gtgaagaaat gagccataag agaangactt gcccaagatc acacagcatg gcagagcccg 60 
ggacatgaaa ctaagcattc tggctccaga gtccacgttt ttaactcaac cggaatactc 120 
agcaatggct gagtctacgc cctgtcgtcc cctcctgggt ctcacagaat ggaaataaat 180 
gtctcaactc 190 

<210> 94 

<211> 509 

<212> DNA 

<213> Homo sapiens 

<400> 94 
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ctttgagcct tagctgtcat taccaggcaa aaggaagagc cccactcagc acccgtttcc 60 

ggttttacgg cccaggcact gttgagcaga ccactatgtg gaaagccagg gaggataata 12 0 

gcagcccccc aatgaggcca cgagccccag aaccatcctg attgctccct ctgaggtgat 180 

ggacagagga aattttccct ccaaggactg acagagaaag aacaacggag atgtggtcgt 240 

ctgctggcat ccattaactt gtgcaactag caaagcaccg agtccacagg gaaaagggag 3 00 

agaaagtgta aatgaaggtg caattgtgtg tggaaggctg agtgtggtca caggaaaatt 3 60 

gcctcatnct tgtattgnaa tggcatcttt tattncctca accccaaggt tntaaagtan 420 

gttccctntt ccttttcnta agccaagcac ccttatgcca ccatcatntn tnacttanac 480 

cacaacttta tcctnctgac atgtttacc 509 

<210> 95 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<400> 95 

ttgtgataat aaaggctcag agaaatcaag ttttaagccc taagtcctgc agtgaatgag 60 

cagcagagct gcagctcgtc tcagtcctgt ggatcacacc atggcctgga aggaaaagtt 120 

tagggcaata taacccccta caaacaacct tccgacaaga ggacaagtgt tttcacaagg 180 

cttcatggaa tgtcgaagtg gaggaacaaa acacttcagc tggaaagata gcacatagcc 240 

agaagtcaac cccaacccta ccaaaaataa tgatgccagg aaacagagct acatacacaa 300 

aagggaatgt gtaccaggat acacataata aagtcccctg gccaaagctg ggattcctcc 3 60 

tggccaagcc agaggagtga ttcaacttaa gagaaaattg gaaggaggac atgtggaat 419 

<210> 96 

<211> 95 

<212> DNA 

<213> Homo sapiens 

<400> 96 

gctggaagga tgacctcgga agtcacatgc tgaagatgga agacatgttg tagtgctgca 60 
ttgacctggg gctcagacat ctcagactct tgtag 95 

<210> 97 

<211> 505 

<212> DNA 

<213> Homo sapiens 

<400> 97 

gacctaacaa agggaatgga gagtaatcac atcattccaa gacccttcct ttgcagtcct 60 

gtagtcacag ctccaaagac tctgggtttt ggagtaagag ctgtaactgc tcaagaagaa 120 

ttcgtgaaca aaagcacatc tctctgagga ggcaaaatat cacaggccta tgacaccaga 180 

ctgctggaag aggcactaga ggttgacaat agattccaac atctcataaa ccaggaagca 240 

gcctcaggaa ggttggcagc tgccaaaccc acaggctaag cagtggtggg actgtgattc 3 00 

aaactcagat attttggttc atctgccagg aaatttttcc tgtcctggaa ttatctgctc 3 60 

ttctcaagaa ggaaaaactt aatccttctt antcctgaaa cccatcttag gaaaggcaag 420 

aaggaaatgc nccaaaatgt taactgnggt tgacactgaa gggggaattn gggctttgtc 480 

tattttttct gcattgaccc atttg 505 

<210> 98 

<211> 500 

<212> DNA 

<213> Homo sapiens 

<400> 98 

gagaaaaaac atatgaacct gagcactgaa tgacttatca agaagatatt tgaaactacc 60 

taaacaagga agtttgtgtt ccaaggtaag agaacctgaa atgaaaaact caggatccct 120 

cacgaacagc ctgaccctgc tttcaaccag gaagttcaag ggaggcagga ctttacggtc 180 

aaaactgcaa agccgaagct caagactgta agaagaaagt gatcttcaaa gaaaaggatt 240 

cacccaaatc gaagaggata tcgtttcgca tcagggacac tcgtctccac acctcctacc 3 00 

tcaaagtcct acgcacctac ccttcacgtc tctncaaagc aactgaatta aagcgcctac 3 60 

tgggcttggc ggngcaagga atttaattca ggaactatng gggaaaaaag caggggagga 420 
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agaaanagga aagacccggg ctgaggcacc aggaagaagg gacgcacaag aacctatcat 480 
tggagcttgt tcgaggccag 500 

<210> 99 
<211> 482 
<212> DNA 

<213> Homo sapiens 
<400> 99 

cttcctgcaa ctgaaggtca ttcctctttg ttagaagact aagggtccct gacctgatct 60 
gtggagcacc agggtggaga gagtggaata agcagcaaaa cgaaaaattg gatgctgttt 120 
tcaaaagttt tgttctcatt cttggattat agattatcta aagggaaaat ttaactcaac 180 
caaaaaattc gttcagctcc atgaagctaa agatgctata aactgactct ttcctaaaga 240 
gcaccaaacc tgaaattttt cctgctagag aggaactaat cttcaaggac acctgtctat 300 
tgctagacat taagaaggaa ggtgaactcc gttctgtctt cataaaacac atttttgnct 3 60 

tttcccctta cttcttcact gaaccccttt tgtttacaaa gtccaagctn tgactggngg 420 
aggggggaaa atctgaaact gtcagcccca aggnggaaca aaatgaaang gagaaaaaaa 480 
at 482 

<210> 100 
<211> 508 
<212> DNA 
<213> Homo sapiens 

<400> 100 

cctcatgtca ctagaagcta cagtattgga cagcacaagc tgcagagtgt ctgttctttg 60 

aggattctct gttctccaaa tgtaaaatca agaatgagaa cgctggcaga agtaaggaaa 120 

gatgagacct gttttgaaaa cgaagtttta gaggaactat gtgaacagat tgtgttcttc 180 

aggggcctgg cacatgatga catctaacac ccacggccaa cagcattcat aatcaccaat 240 

acgcagcatc atactctgtc tactggcaat tcccagagat ccaagaaata tgtaaaacac 3 00 

tggctagaaa gtgttcttgt ggcacgaggc ggtgctcatc aagtggcttt agggtgcact 360 

ggtcacctgt tacattccag gcttctggag gacctgagtc cttgccccac ttnancccac 420 

accacctttt gtcacccttg agacttataa ccaggccagg cgcgatggct catgcctata 480 

atctcagcac gatgggaggc cgaggcaa 508 

<210> 101 
<211> 376 
<212> DNA 
<213> Homo sapiens 

<400> 101 

caaatgtact ctatcgtctt ccacactggg accccagaca ctcatggagg aggaaattct 60 

tgaccaaaaa tatgtgttac agaacctgag agagaagaaa aatttcagga agacgatgac 120 

agtcaataag atgaaatgat gaagtaaatg taaacatgat acagactgag gccattggct 180 

ctgaatatcg agacatcact ggaatgtttt gagaaattaa ctttgattgc gaagagatta 240 

agaattagaa tgcagtagga aaatgaatta acatctgata agaaaagaaa ccaaagagtn 300 

aagacctgta gttctgcaac acagatgctc atcagaaaaa tgtgggtaac cttttcaata 360 

ataaaaccct ggaccc 37 6 

<210> 102 
<211> 304 
<212> DNA 
<213> Homo sapiens 

<400> 102 

atgtctgatg tccnagtagg agtgattatg gttactgtgt gaagacttga ctctcaagga 60 

gttgcaggat catacgtggg aagtggaggg gttcccatgt gaccttctat gaagatcaga 120 

agaatagaaa acctgaagaa tacattttgg ttggaagaat agaaagtctg cctagagngt 180 

ctttggaatg ccagaggatg agatccgtct tgtttactaa gagttgtnac ggntcccctc 240 

accttacctc ccaaatcctg gtnaggaacc aggacctgcc aaggtgaagc actgatacat 300 

tttg 304 
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<210> 103 
<211> 501 
<212> DNA 

<213> Homo sapiens 
<400> 103 

gaatcccatg tgcatgangc ccctacctcc ctggaccaca ccancatgag atgtcttcct 60 
gtggcaatga gggtcacgag tcttgcctga ttttctatgg ttccagaatc acccaagcgg 120 
ataatgaagt gagntgcagn taanatggag cccactgggg aagagatgaa gcagtgttca 180 
cctgaagcac catctgcatt ttcctagtcc tgacagttac ctctanctga ccagggtttc 240 
tgtgcangac ttctggtatc aaatcaacga tcaagggtgg tnacacataa agatgaacag 300 
ttccatacgc aggtttaaaa aagaangcct atgaagaaat ggtaatactt aaaagcactc 360 
ttgaagntaa ngggatatgg cgntangaaa acctttaaga tccttttant aggnnagaaa 420 
atggtctcct cantaaaaac aaggccgtan gntttntttg ggctttcgcc aaatgcaacc 480 
tgccntnccg gccggtgcca a 501 

<210> 104 
<211> 431 
<212> DNA 

<213> Homo sapiens 
<400> 104 

caaaacngan gacccagcct tgtgtgcana ngccgctgaa cnnnngaaag cccgaannga 60 
ancananagg ggctcangac gctgtgagac ttttccattt cctttgcctc ccagcaggcc 120 
gngaaagagt cacttttcct tgaggaagaa agaaggctct gtgtgcaggg caagggtaca 180 
gtccttctaa ccaaaagatg tgtgtgctgc atgggatgtg gccaccgaca ttcatttnnc 240 
ttttactggg acttaacgaa ttccatctct cagtagccat atgccagggt cccaccctgt 300 
ttcctctggc tctggagggn ggagaggaag gacttgcttt acccaagggt ctataaggaa 360 
tcttgggaaa gacactgccc cttaaatcac tttttgggca ctggtgtcac ctttgtgtca 420 
cttgtgtccc t 431 

<210> 105 
<211> 414 
<212> DNA 
<213> Homo sapiens 

<400> 105 

gacccagctt gtgtgcacan ncnncnngan gacaattgca tcactggctt ctaccacttt 60 

gacaacaggc agcaccaaaa gcagggncng gaggactaag gacaactgtg ttgaaactga 120 

gtcaacagct ctgtttgagt aaatgatcca tccttgaatc gtgtatgcag agacaagatc 180 

agcagttgga ttgttgtttt aataaactgg aagtctgcca acattatctg ggaagaggac 240 

gaggacatta atgctagcat gcaatctagc cgtgtttgga tttaagacag aatttaatct 300 

tcttgcctcc tttcctttcc ctcctcccct tttcagncct tttttcctta atacacaagt 360 

ctcttttatg gagttaactc aagctatctt aaacagcatg aactaataaa ggca 414 

<210> 106 
<211> 435 
<212> DNA 
<213> Homo sapiens 

<400> 106 

tcatgcagac acctgatgga agangtcttc caggcagaag gaaggacaaa tacccttgat 60 

atacatgtac ttggccggca tgaggaagag caatgtggaa gcctactcaa tgtgaagaca 120 

aggatgaaga cctttatgat gatccatttc catttggtga atgcctcttt caaaagaaga 180 

cgtaagacat ctggtgtcaa gaagaataaa tacaatacca ttaaagaatt ataaacagaa 240 

ccagagccag agaagaatac catttttact tgacagatga ctgacacaaa acttggttac 300 

acagacgaag tatttaagca agatactttc tcgaaaatga acaacacgcc gactgncatt 360 

tcaaggaaac caactgacaa catttcctgt taggacaaaa tacaagtttt caaccaaatn 420 

ttagaattta ggaca 435 

<210> 107 
<211> 437 



<212> DNA 

<213> Homo sapiens 



<400> 107 

ggaattctaa aagtccaaac tccatctttg gacgccaaac cggactgagc agaagaatct 60 

tctggtatgt gaactagggt cctggttctg gttatcagct ctcctccacc taaataagac 120 

ctgattccca ggcaccacat gctgatgtgg tcaggaatga gatggcacct acctctgcag 180 

cttggcagct cctcgaatgg agacattggg tcttattcac ctctgggtct ttagcaccca 240 

gcacaaaggt cagacagggt ccagacgcag ttgtgcccac ttttcgaggc tagaaaataa 3 00 

tgatctaagg aaaagacgat tttgaggnct tcagaaaggg aatacagcag caaaagccag 360 

ggagcctggt taacttcttt gagcacttgg aaggataaan aaatccatac cctggaaaat 420 

ggnggtttgc ttaaatg 437 

<210> 108 
<211> 383 
<212> DNA 

<213> Homo sapiens 
<400> 108 

ctggggagct cctgcattaa gnnataactt ganggaagac aaccaccatg tcctgaggcc 60 
actcaggcag cctacgaaga ggccacatag agaagaacag agggctgcag tctacagcta 120 
gcaaggaacc acagcctgcc aacaaccata agagcctgcg tgggagggga ccttccagcc 180 
cccattgaca gcctgagtgc aactccatga gagacgctga ggagaatcaa gtagctaagc 240 
ccttcctcaa ttcctgactc tcacaaactg tgcaagataa taaagattcn ctcttttcag 300 
ctgcaaaaaa aaaaagggnc nggggggccn tttnngtngg ncttnancng ggggaanttn 360 
ttnnaaaggg gggggccccc ccc 383 

<210> 109 
<211> 79 
<212> DNA 

<213> Homo sapiens 
<400> 109 

gactttgctt ctgggaagat ggagtacttt tccttattct ttccacaaac gacaactaaa 60 
atccctaggc attatatat 79 

<210> 110 
<211> 473 
<212> DNA 
<213> Homo sapiens 

<400> 110 

ttctgtnacc tcaagcggca tccctgggcc ctggtctcca agtcccgatc ctgtctgaaa 60 

aatggcgctg aaggcctagc acanggcagc ctctacctca aagcaccatc ccgcttaaca 120 

ttccaacggn gcctnaaang aaaaaccctn tggtggggtc caccaaaaac ccctggcctc 180 

catgtgctcc ttcctggccc caaggacagc ttgacactnt ccaggaagna aaggccaang 240 

ggnaaccccc tttgcaanaa nacttatttc ttaaaaaaga tctnggnttn tanantcaan 300 

ggggacctgg gtttnaaagt ccccggcatt ttgcccttct tgaacttcac canttgtttc 360 

aacncntttt ngggccactt ccacctttnc cccttcatnc tngggaaacc ctccangttt 420 

ttncctccat tctggggnaa gtccaagggg ggnggggngg ggaccccacc ctt 473 

<210> 111 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<400> 111 

ttctgtcacc tcaagcggca tccctgggcc ctggtctcca agtcccgatc ctgtctgaaa 60 

aatggcctga aggcctagca cagggcagcc tctacctcaa agcaccatcc ggcttaacat 120 

cccagcggtg cctcagatga gaagccctgt ggtggggtcc accagaaacc cctggcctcc 180 

atgtctcctt cctggcccca aggacagctg acactgtcca ggaggaaagg gcaaagggga 240 

agcacgtggc aagacactca tttctcagaa agtctgggtt aggagtcagg ggacctgggt 300 
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tcaagtcccg catctgcctc tgactcacaa gtgncacctt tgggcactta ctttcccttc 
gctggacctc agtttcctca tctgggagtc aaggggggtg gaccagctga tctccgg 

<210> 112 
<211> 262 
<212> DNA 
<213> Homo sapiens 

<400> 112 
agatggggtt ccatcatgat gcccagactg 
ccaccttggc tgaaatggcc tgacatgatc 
agaacatgaa tggatgactg tttccttagc 
gaaaccaaaa gaaaagttcc catccatttt 
gtggcttaaa attataatgt tt 

<210> 113 
<211> 229 
<212> DNA 
<213> Homo sapiens 

<400> 113 

gctcaaccaa atgcctctgc caggagaatc tttcagagtg tcctggaaca ttggaaatag 
gcttaaagct taaatgatga atcagaagag ttatgctgta ttctaatgct gccactaggg 
ccacacaggg tgccaacatc caatctcaag atcttcggga aatatgctca ccctccaaaa 
tacttacaga tgtgtctcct cttttttgta aaataaatgc tcttcttat 

<210> 114 
<211> 318 
<212> DNA 
<213> Homo sapiens 

<400> 114 
gtgctgcaat caagagaaag agacagagcc 
gaaatatgag aggctcaagg gcagggctgt 
ctggctgcat ccaaactgca atgaacctat 
ctgctgagcc tgtatgaagc tactatccca 
aattcaagtc gacactgaac tagtggtaga 
catcccgaca gttctgag 

<210> 115 
<211> 426 
<212> DNA 

<213> Homo sapiens 
<400> 115 

atgcacagan aatttctgac cttgngacgt ttgggagtga ggagatccca tacagaggca 
tccangnatt tccagagatc ctgtggcngg tgaggnctgc cctcnctgga nccaactcgt 
ctataatatc ttcctaacag cangagtcgc ctgcggggag gagaggagaa gacagactaa 
gctgcgcgta gagcggcatc aggagcaagt taccgttagc atgtgtaaac aaaacaactc 
gactcctctg tgtcagaatc aacaacatca aagctgataa tgtggctggt tgggatcaat 
tagcactgga ttttgcccca agattgcttc ccaaggcgga caagtgggag ccacttcatt 
ttccagcgac ttttacttcg ntcacgggca tatccacgcc agggctgcag aagcatttca 
aaaggg 

<210> 116 
<211> 229 
<212> DNA 
<213> Homo sapiens 

<400> 116 

tgacacacgg agaggaaaca tcagattgct ttttatccgc atctataagc ccgggtcata 
actggagaaa aagccaccat caacccagaa ggccaacttc cataattata tgaatcgttt 



gtcttgaact cctgagctca agctatccac 
agcactgggc gtgacccaaa gatggaatga 
aacaagaacc atatgtttcc tttgaaacaa 
tctttccacc aattcaaaga ctaaatagta 



acactgacaa gaccacgttc tagagagaag 
gaggacaagc aggggagatg agatgaggag 
accatagaac acagaacaca aacattgaac 
ggactgtgaa aagtagacta gttgaggaag 
gctctcatca tacagatcgt tggaaagtag 
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gtgaacattt atggattaaa atgtttgagt aaagctgaaa tcggatatta cagtccatga 
atagttcatg ccatgagaca aaaaattaaa gaaaaaaatt tcattgatt 



180 
229 



<210> 117 
<211> 430 
<212> DNA 

<213> Homo sapiens 
<400> 117 

catgaactga ggtgttccat gggtggtcag ccgatctcca cccccaaggt tgccttccca 60 
gagcctcaga cccatgcccc agcgttatgg agatgtcttc tggaagaacc ttaatcaaag 120 
gcccaccccc acttggctgg aggagcagca cattccaccc atgctgagag ccactggttg 180 
ctcccagctt ggtctgtatc ctcctgagca gctcccaccc cctgaaatgc tttggagaag 240 
aaagaagagg aggccatgtt tggaaggaat gcagcagcag ggccttgggg gagtccccgc 300 
ccgggtgagg gctgtcactt accacctgga ggacctaaaa aaggcgtcag aagcattatt 360 
aaacgaactt gaaaaaggcc cagtggggca agcttntggg gctggcatct tganccagtg 420 
ggtgcttggc 430 

<210> 118 
<211> 435 
<212> DNA 
<213> Homo sapiens 

<400> 118 

cnaanctnna aagggcncnt nccaggttaa aacccncann 
aaggctgncc ttnggctcca tcaacactct gctagccaac 
ctgctatcca cagctctggg gcacttctct ggctgtctgt 
caacctcatt ggccaggtaa aagctatcga aaataaactg 
nccatgaggn ttaatacagg aaaagctgat agtcaaaagt 
tctccacagt gaaaaaatgn ctttangctg gaataccaaa 
ggacctgnct tcaaaagatt ttnatcttcc cttttccctt 
aattcccagg gttca 

<210> 119 
<211> 405 
<212> DNA 

<213> Homo sapiens 

<400> 119 

aaatggggaa gattgaagca aaaaatggaa 

ggttcccctg ctactcttgt gaagtttcca 

tcagggttcc aatcttttct cccttaaaaa 

tttccattca acaaaaccgn tgggcttacc 

gtggactctg aaacttgctt tttaaaaaaa 

agcaagttta gacatctgca ggtctgatgc 

cacacacagg gccaggcacc ttttcttctt 

<210> 120 
<211> 424 
<212> DNA 
<213> Homo sapiens 

<400> 120 
gcgctgaccc acgaatgcaa ctctcagccg 
agaagggctg ggagaacatc aaggcttggg 
agaaggtgaa agggagcgtc anccccctcc 
ccancatgcc gctggagcag ctctacttcc 
gaggaggagc ccaaatacca cgccacggcc 
ctgggcgatg atgaaaaaag aagttcatac 
taaaccctga accctcgtgt tcaagcttgt 
agcc 



cccaaaaaaa atngggttaa 60 

actttggccg caagttcact 120 

tagtaaccac taacctaacc 180 

aaaattgcta tctctatatg 240 

caagntcaaa tggcatttgg 3 00 

gaactnggga ggcaacaccc 360 

ggntggcagg gcctaaaatc 420 
435 



cacgttaagg ctatttatga 
ggtaccaaaa gcaaacttcc 
tacaagatcc agaagaggag 
aaccttacac tggaaacaac 
gcgtttcaag cgataagtgt 
agtcatcttc tggggggttt 
tagcagcaga agaaa 



agtaagaaat 60 

tcctaacgac 120 

ccctgtcaga 180 

aagctcaaaa 240 

aacgtgctac 300 

acccaacaga 360 
405 



agctgtccct gccggatttc 
ctaaaacaat tatggcccat 
tgagtaacca agtcctaggg 
tgcagagcac tccttccacc 
caagaatcat ttgctgtttc 
cgtttttcca acaccgtgaa 
naagaataac agccaataaa 



aaacagctga 60 

gaaaggagag 120 

aaggagatca 180 

cctccccccg 240 

aaatagagaa 3 00 

aaggacctnt 360 

aactacattg 420 
424 
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<210> 121 
<211> 422 
<212> DNA 

<213> Homo sapiens 

<400> 121 
nnnaactgaa ataangaagg atnggtcaga 
cactccagag ccctgcccca tctagggcgc 
ccttggtcca accacagtcc aggaaagcag 
aaaatggaag tacaatggag tgctgtacct 
actcacagac tccctagagg agaactccac 
ttctgagcat gccaggtgag gcctctccct 
aaaanaaagc aagcagcccg cgtgaccaga 
ga 

<210> 122 
<211> 409 
<212> DNA 
<213> Homo sapiens 

<400> 122 
gcttantagg tattccattg ngcntacaga 
ggctgnanct tggctcttga gaataangca 
tgacattaca ggaggagcag agttctatca 
cttatcatca tctcatcttg tgccggtata 
taaggaatat ctctccatgg agacagagcc 
gttcctgttt tcaagtaatg atttggataa 
tctggcaagc tgaaattaat tctccaaaga 

<210> 123 
<211> 419 
<212> DNA 

<213> Homo sapiens 
<400> 123 

gcgctgggga gctcctgctt taagtnanan cngaaatcac ccangtcann aagganaang 60 
aaaatanaag ggcaanctcg ctgtaaagaa nggattactc aaangtngaa ccaaagccgg 120 
gggaaagaac atggaaagca gtggagaggc accaggcagg tcgctttctc tttctggtcc 180 
tcaaccacag cactgccgtc ttcagaacag taactattac ttgtccatac caggcatctt 240 
caatactcct caactcatat caagaattct gcccagtcta aacagacctc catcctacaa 3 00 

acactgaaac cctaacccaa aaccttacat atatccacct ctcacttatc ccttctgaga 3 60 

cantatgaaa aacaaagngg cagtttccct tactggaata agtattaaat tttgcttgg 419 

<210> 124 
<211> 410 
<212> DNA 
<213> Homo sapiens 

<400> 124 

gagccgcaaa gacagcctgg aaagtgacag ctccacggcc atcattcccc atgagctgat 60 

tcgcacgcgg cagcttgaga gcgtacatct gaaattcaac caggagtccg gagccctcat 120 

tcctctctgc ctaaggggca ggctcctgca tggacggcac tttacatata aaagtatcac 180 

aggtgacatg gccattaccg tttgtctcca cgggagtgga aggcgccttt gccactgagg 240 

agcatcctta cgcggctcat ggaccctggt tacaaattct gttgaccgaa gagtttgtag 3 00 

agaaaatgtt ggaggattta gaaagatttg acttcttcca gangaattca aacttcccaa 360 

agagtacagc tggcctgaaa agaagctgaa ggtctccatc ctgcctgacg 410 

<210> 125 
<211> 358 
<212> DNA 
<213> Homo sapiens 



nanacagcca acggtgtggc 
acgtgcatgc ctctgaattt 
attttctatg ccccgtggca 
acccaagcac caggaggcag 
gcacccaaac tctgctgtgc 
ctctntntnc cttcattcca 
cagagccttc cttgctaata 



caacaatcac 60 

cctccccttt 120 

atcacagtgg 180 

gagtcgagct 240 

cccctctgag 300 

agtttttngg 3 60 

aacccatcct 420 



cctcatttnt tactccattc 
ccaangaaca tgggagngca 
tgtagaaggt cattcacccg 
caagtaagat cagccagctg 
agacggccca agtctcttct 
actgggagaa ccagtttcct 
ctcctctttg gaggcaagc 



atnngntgat 60 

gcaaagctca 120 

agcatgcttc 180 

ctgaaatctc 240 

ctgttcttga 300 

ttcctccaac 360 
409 
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<400> 125 

crmanactga gagataggan ctcgctacgg ttgcctggtc tcaaactcct gggcccaagc 60 

catcttccag catttgcctc ccaaagttct gggattacag ggcctgcaca ccaatgaaac 120 

tactgatatc agctgttctg aagaaaccca gaagagactg aatcaccaaa gagtgcagtt 180 

tccacatcct gatgatttta tcatccttac tctgaccaaa cagtgacctc aattttacag 240 

cccctcacac cctataatca tcctaaaaac ttcagcccag aactcctcag gaggataatt 300 

tgagggtttc tcccatttcc ttatttggct gccctgtaat cattaaacac tttctctg 35 8 

<210> 126 
<211> 488 
<212> DNA 
<213> Homo sapiens 

<400> 126 
gtctggggag ctcctgcann anncntgnac 
gccatctgga agctcagggc natgagaaca 
aacccaataa tcaacanaaa cccagagggg 
cccttgaagg caggaactga gccttcattc 
tgtccctgaa gccaggccct ggctgagaat 
gctctatcac acaaatttca tccatgtgtn 
tgggctnccc cagtccctga cacagaactt 
ataactgttt acatgtcagt ctcctctctt 
cttaatgc 

<210> 127 
<211> 437 
<212> DNA 
<213> Homo sapiens 

<400> 127 

gtgaggncac acgtgnaaca acacgntgtn tgtgaaccat gaaagggagc 
accnnacctg ccacagcctt gatcttaacc tttgcngaag ncacaactga 
nnnntgtggt ttataaccca nccagtntat gatattntgc tncannaacc 
aagacnctcc ccaccatgan aatgtccaaa cataatgnga cagatgtctt 
gtggatgctg ngacanaggc ntttacaaac acagagcaac ccagggagct 
aatgaggctg gaaggaggct cananaatcc atctttccag tgaacttgga 
caagtggagc anaggggaga gaatntcttt gaaaacgcag ttgggagaca 
acgggaaaga aacaagg 

<210> 128 
<211> 438 
<212> DNA 
<213> Homo sapiens 

<400> 128 

attaaaaaga aaaaagaaaa tcaggtggga taaagagcct caggtctaac tgaattgtca 60 

actaatgatg gtctgagagt acctgtgctg aaatggaatt gtctttgagt ggacacttct 120 

tagatgagac ctattgtggc caatagctcc tgaggaactg aagccttcag ttcaaaactt 180 

gtgtgagaaa aatgaatctt gccaactact ggagtgagct tagaaatgaa tccatcccca 24 0 

gttgaccctt gaatgtagcc ttgtcagaga cccagagaca aagcatcctg ctaatctgca 300 

ctgggttcta ggcccacaga aaccatggga taataacttt gtgntgnttt taaccccttg 360 

aaaccaacca aataaaatcc ttaagatgtt cccctgngga agggttccat tggcagggat 420 

ctgcacttca caaccaaa 435 

<210> 129 
<211> 442 
<212> DNA 
<213> Homo sapiens 

<400> 129 

ggcaaattaa cccagaagag tacttcagag aacacagaca aactgccgtg cagtgaagag 60 
aatgtggcag gaagccctgg tattctagaa gaagctctgc ccactccaga caggatccgc 120 



tgagagttgg ctnangagaa 
acctgggccc tggtctctca 
aaacgacctc ctttcagcan 
cagcactaac tcaacaaaca 
gctgaaaaga ttcagagcag 
ctacccaagt gataccactt 
tttggtcacc aacctaatca 
cgtcccctga cagcagggat 



gatcaagagt 60 

agccaccatc 120 

gactgggaaa 180 

tttcctgagc 240 

atacacgtgg 300 

gctctttctc 360 

ttcanggatt 420 

atggntggcc 480 



ttcgacngac 60 

gaganna tnn 120 

tgaatggact 180 

tacatcantn 240 

gatcagcatg 300 

acaccagaaa 360 

gagccangta 420 
437 
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acgcctagtg ccatgtctat ctccaaggag atcacattct agagccaagg accgccactg 180 

agaagaaagt aaccgtgagc cgtcagaatg catacctgga gcgctccagg aaggaaatct 240 

cagccccggc atcctccatg gtcacacgga gagggcgggt gtccttgtag ctttggccct 3 00 

gagatgggag ctagagctgg acacagggtt ctagtcctgg cttttgtgga aacaagttcc 360 

caaacctggn gcaagngcct tacctgtctg ngtaatgggg ggagctgatg tggatcatct 420 

ttaagccctc tgcaagatgg ag 442 

<210> 130 
<211> 440 
<212> DNA 
<213> Homo sapiens 

<400> 130 

gaggtggagt cttgccatgc ccttccatta caaaatcctc ctgttccacc tgcaaaggca 60 

agcaccacag gtcagcagca gtcagtaact acaatgcgac tcactccaag aacccacacc 120 

tgccctgtgc agaaccacag ggccgtttca ctgtggggca cagaacagaa gcctgggcca 180 

atggttttca aacttctcct tgagtgatta gatctgcaga aaaaaggaaa catgttgatc 240 

ggcaaaacac ataactctga caaaggatta gcatctagaa tataaaagaa cggtgatgaa 300 

tcaatgagac aaagacagcc tactagaaaa atctggaaat aacccaagcc gggaatttcn 360 

ntgaagagaa cacataancn gttntaacat atgaaaagat attcaatctt atgtcagtca 420 

agaaaatgca aattaaaacc 440 

<210> 131 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<400> 131 

gaagaaaatg ttaaaaagta ataaccaaag aaaaagtcag ccaactccca cagcctggtc 60 

ttgctgtgct gaatggcaga gaagatcaca gaggaagaaa aaagaaaaag acagaaaaaa 120 

ggaggcggag aatttcttgc ttaaactgga cctagtccag ctggcaagaa gaggtggttt 180 

tcttaacgcc tgcaaaacct gattactttt tttaaaggaa tgaagaagaa ggagatgtaa 240 

acacagccat taaaacagat ttaaggtact tagttttaat ctagtctaag accttttcaa 3 00 

ttgtatgctg ctctgcaatt ctctgcttgc tagacattaa tacngngcat aagcccntgg 360 

tcagngtctt ttaaccagng aacgctttca gctgagctct gnggttaccc tctcaggtca 420 

ggcatggaag gcct 434 

<210> 132 
<211> 437 
<212> DNA 
<213> Homo sapiens 

<400> 132 

gtaaacccag ttcactcagg cagaagcaag aggaagaaca ttcctccagc tcctcctcat 60 

gcaggcccga gaggtgggag ggcattctgc cagcccagta tatccacttt gcttcgacaa 120 

atgtcagcct gcccagaata aggaagtacc cacagccggg aaaggtaaat ccaaaccctg 180 

aaaagacaga tactgagcat ttgaaataac acagcttgca gcgtccttgc ggagccctgt 240 

ttatggggca ataaaccatt taaacgactg tgtgttggaa cccacaaggt cgccttgaaa 3 00 

ggcttttcac agacactgct agtagggctc caggacctct ngaaggccna gatngggggg 3 60 

nctttttgct tntgcttgaa gcttgntggt tcccctccat cangaacgcc agcccctgga 420 

gaggctgcca tgagaaa 437 

<210> 133 
<211> 341 
<212> DNA 
<213> Homo sapiens 

<400> 133 

gaagaaacac aagatttaag gttgtttgtc aactgacagc cctttctatc aacaactaaa 60 

taaaaaaatc tgtattccag aaacatgaca cttcatgtac cacccatttt cctcataaga 120 

aaccaaaagg tgtccatgac ttaggtacta aatggcaagg ctggaaccag aatccaagtt 180 

gcccagtcac acagttttgg tttttaaata accaaattgg tcaaaaatct tcctcaaaga 240 
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caaaaacaga tgaaggtaaa atgccaattg gttaaattta aacagagact tcactttgtt 300 
cttttcaggt tcaataataa acaattctag tgattagcat g 341 

<210> 134 

<211> 442 

<212> DNA 

<213> Homo sapiens 

<400> 134 

gagtaaacga tcccaattgc agtatatctg nggntcatct ggcttcttct cacaccacct 60 

ctgttgacat gggaggcctg ccggccacac atccaggaag tatgaaatca gcggggttcc 120 

tccccttctt gctccaggga agcctgagag ggactctgca gattgcattt ggaatccatc 180 

tgccagggag gggtaagaag aagcagagtg tcaccgggta agagtcgaca gttttgaaga 240 



ctcgtagctg cgaatctttc aggaaataat ccagaacagt ctcctcgctg gacaggaaag 300 

gaaacctatc ctagagaggc gaatcctctg tcctggaccc ctgccccana aaatgggtca 360 

ggggagggga ttntttgggg gngtttcnac ctgctgcttg cagggcttcg gttgccaaga 420 

gtttccccaa tacctaaacc cc 442 

<210> 135 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<400> 135 

tctccatgct ctggatagag gaggttcaca agccagggcc tgaagattaa cagagctttg 60 

aagccaaaag gtgacccctg gaccatggac ttcgcacctc ctttcttaag ggctttaaaa 120 

tagaaaagaa caggagctag aagatgaggc agaagtcgag gacttctgtt tttctggaag 180 

gctcctctga gccaacaagg ccagggctgt tctggatttc agagcacaaa gaggctcctg 240 

gagccagcca tggtctcctg aggcttttac caacttgaaa gcagcctttc tccagggcag 300 

aaacgaagca tctccccagc gctcgccatc ctcagctgnt ctttacaaca agaactttac 360 

aaggatgccc ggatgaaggc ccaananacc cgcgttctgg gcaagccact tttaccacac 420 

cgactggatc cccc 434 

<210> 136 
<211> 433 
<212> DNA 

<213> Homo sapiens 
<400> 136 

gtacctaagg cagtaaaccc ccaactccct ggaagggccc actgggcgct cacttcgctc 60 
cagagcctcg cctggtttcc gcttcgggat ccggtcaccc aacccagctc tccagttgct 120 
gctgtttctc gtgagactgt cagagtgaag gggtccaaag ctccgacttc cagcctcaga 180 
aatcccaact caggcaggat cagcgaagcg tccctcgcag tggctggagg gagagccagg 240 
cggggcccag gctgccactt atcagggctg taaatgccac cctgaggccc acgcctgcca 300 
acactgctcc ccacaagact aagtcctgca gcctcagccc aaaaagaacc gggcctaacc 360 
ccaaaacgga nggtcatgtt caagccacac cccagtgaac cctggcgacc caccccacag 420 



tgccctgccc tec 433 

<210> 137 
<211> 443 
<212> DNA 

<213> Homo sapiens 
<400> 137 

gactagaact attgecactg aggggcaggt gggaagttca gccaactcgg aacceggagg 60 
ccccacctta cctccctttg tgaagagece agagectttg tccaaagctg catcacttcc 120 
cacccagccc ttcctgagcc aactccccga tgtctccaga agaacacagt cggcatcatc 180 
gtgataacat cagggaaact cctatttcca gcagtttctc cttcagctgc aaaaatgtgc 240 
agcagtagac agggcgtggg tttttgaagt etctgeagga ggtagagtta ttttctcagc 300 
accacatctg agegcatett ctaagggtgg ccgactgtgt gggaactgea agagcttaac 360 
cegggatgea agccctccca ttccccaccc tgtccactac caccacgcct ggatccgaca 420 
ggcagggcag gaccccatgc ccc 443 
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<210> 138 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<400> 138 

gctctgggga gctcctgcat tannncntan ctgagtatca tccntctgcc atcaagaatg 60 

taagtatgaa gaatgttccg acactgctcc aggactgtct ttcaagccac tgacaaccat 120 

cctgcaaatt ttgatactgg tgcctgtttg gtgtccctag aggatctaaa tgaagatgtg 180 

aaaacaacaa ctaagaaaat attttaaatg gcaattactc aacacgagaa gttaaaacaa 240 

tgtccacact gagactgaaa tgacagcaac agaaacagca agtcagagcc atgcctgtac 300 

aatgacaact agatcaaaac tgccacctgg ccaaaagcaa tactcagatg ctattaactg 360 

taagacagtt aatggtatgt tatgaggtga aaaaaaaaat tcctt 405 

<210> 139 
<211> 448 
<212> DNA 
<213> Homo sapiens 

<400> 139 

ccnttttgat ccccacctac aactgggcat cgctaacaac ccatgtgagg tacctaggaa 60 

gaatgagaag cttccagcaa ggcagctgct tccagcagca agctcctgca tagcccacag 120 

gccattccag ctcaatgctg gagaagaatc ttccccctaa cagcactgcc cagcactacc 180 

caactaaggc ttctctggtt aaactgccca aggatgccca aagaacttgt ttctaaagga 240 

aggaaaacag atgccaagac ttcttgtgct ttctccaggg ggctcagagc aggccctgat 300 

cactaccctg gatgcacaaa gtatctatca aattcccaca aggtanaaag ggttgccagg 360 

aatgggaaga aacttcaata ttcgaagtca ccaatcacag aagataactg gcaaaacagt 420 

tctactaagc aagcacagag ccatttgc 448 

<210> 140 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<400> 140 

aactgaggtg gtggtggtca agagcaaggt cgaggctcac ctgtgcccat ttggttccgt 60 

acattgctca ctagaggcat catcgacaga gtatgaatca gctccccaat tagcctgacc 120 

gtaatcacct gtgttgcttg attattatac aaattccccg acctcatacc gacctactga 180 

atcgaaatct ctaggagtag attctgggaa tctgtatcgc tggtaaagct cccaggtgat 240 

tcctataatc tggcaatgtg ggagacacga gcattaaggg aacccagcaa caggctccat 300 

cctctgccta acatcagcaa cctcagcaga gacttggtcc cagggaccct tgttccntta 360 

tgtaccccaa gacactgtcc ctaaatggng cacaaaagca agactcaggc ctgtctcaca 420 

cactggcaaa gctgctgccc cccagctcaa accagctc 458 

<210> 141 
<211> 451 
<212> DNA 
<213> Homo sapiens 

<400> 141 

aagcttgtga gacctcaatg agtcatgaag aatcctaatt tcaaatccaa agaatccaaa 60 

gtgatgataa caaaaagcaa taattgatat ctgaacaaag attcttgggc agccgagccc 120 

ctcttgaatt cctcagccta ccatcatgat caacacctcc catgttccgt ccatgaatga 180 

ccgcactgac agcactggag agatttaatg ggtcaccaat tgaggcagtg aaggcactca 240 

tggcactcag agctggaatg gggctgatct gagttgtact gttgactgca gtggtgatga 300 

caacctgcat tcctttgctg gctgcatcga caactgcttt gtnaatgggc attntaccgg 360 

aagcatcacc tggggccacc cacaacgagg ccatncttca cctgttgacc aagagatggg 420 

tcaatcctcg gttgcaactc acaaggtgtt c 451 

<210> 142 
<211> 450 
<212> DNA 
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<213> Homo sapiens 



<400> 142 
atcccttctg gagctggtcc taattgcttt 
tacctattca gcgaaggcac tccaagtcct 
gacttctctt ctgtagagat cacaggtgca 
tggaccccta ggagcggccg tgatttgtga 
acacagccgt ggagagccaa gaactgggag 
agggagaacg ccatgtggag gtgaagataa 
gaatgccaag gacctccagc aaaccaccaa 
tttctcacag acctcagaag gaaccaacca 



tcacaggagg gatgcaaact ggaaagttcc 
gggctctttt ctcctcgggg gcaaagatga 
tctgtacagg ttggagtgct cccccaaccc 
cacaaggccc cacccgttga tctactcttc 
ggaggaggaa atttggagac agagacacac 
agaacacaac ggtgcttntt acaacccaag 
gaagctcagg gggaggcaca gaacgaattc 



<210> 143 
<211> 452 
<212> DNA 
<213> Homo 



sapiens 



<400> 143 
tcagagttta caccttactg 
caacccctgg gcattccctg 
aaggagtctc ccagatgctg 
gcagaaaggc catccctcca 
ctcctactcc ctttggtgcc 
ttcaagccct gnaggcacct 
cttattgatc tgccagggtt 
agccaggctg ccatcacctc 



tacggctgac cacctgaatc 
ggcactaccc agcaaagccc 
caccagctgc ccagcgctgc 
ctccctggac agccctctcc 
tggtctttcc cagcagctgc 
tgactcctaa atgaatgaac 
tccacccttn catctnttca 
atgttccaat ta 



ccaatctcac gaaacaccca 
tatctttgca tcggtctcag 
tggaggaaat ctccaccgct 
acgtcaccca cctgggtcct 
ctacccccaa ctccctgcta 
ttaactgctt gccctgcccc 
gggcctgcct ttgcagcaca 



<210> 144 
<211> 258 
<212> DNA 
<213> Homo sapie: 



<400> 144 
ctgtcctgag agcacgtctc tacatctcta 
aaaacggaca agaacactag aatcagcccg 
tcccgcctat ctgtggtggt gggtatcatg 
anctcatcan ctntngttag cattagggna 
ccaanggggc cctgaaaa 



cctgcattct ggaatcaagg ggaaaaggcc 
tgtcccaacc ctttgactac aagggacttt 
aaaattatgc acaaaccttt ttttttttta 
tttnatntgg ggcccaggag cattnttttt 



<210> 145 
<211> 445 
<212> DNA 
<213> Homo sapiens 



<400> 145 
gcactcattc tctttcctgt caccctgtga 
cgtgtcctcc aagggtttca aggttatcgt 
agggaacttt gagtttgcca gagaaaactc 
agacatttga cttctttaat ttttgcatac 
agtggttgta tggctatttg caaaggctga 
ctttgatatt tgatgnttgn attcaaaatt 
tatgaattat gggggggcat ataanacttt 
tgccaaaata gntgttggtt tgatg 



agaggtgcct tccgccatga ctgtgctgaa 
atgccctgaa attgggcaag gagctttaag 
aagatgtttc tacatgaaga aaatggtttc 
tctttgtgat ggttgttagc aaagacctaa 
gtgtgacttg atattggctc aacttgaaaa 
ggaaacaaag gnggttaaaa agggnggata 
gcagaactta cctgcncctt atatattttc 



<210> 146 
<211> 437 
<212> DNA 
<213> Homo sapiens 



<400> 146 

gtttgcctgt ttcctctggt tccagtccaa gcatttgtgc tatccttcga gtctttacaa 
attgccctga aataatatgt gctgtgcctg cctctgtaca gttcagctca cctttgagac 
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atttcgttgc gtttgttcca acagcggtca attgtgttgt 
ctaacaccag cataccagcc gccctttctc gtgagcttgt 
aaagagttaa tcgatggata tgaattaaac acaggaaacc 
ctccaggcct aaaaccactt gtgactggag tgacgttaat 
atggtaacag gatgctgaaa cctgacacat acaaggnact 
acatttgatc actcttg 

<210> 147 
<211> 453 
<212> DNA 

<213> Homo sapiens 
<400> 147 

gcttcagttt aaaaggactg cctgtcctag ctgggattgg 
tgatcctccc gggacccaga gagatcagca gaccagaagg 
cccccaaccc aggaatccct gtacgacttg aggcattatc 
ggtagatgcc atcattaccc tcatttcaca cctgcagaaa 
ctggtctagt cacgagggat tctgtgatgc ctgagacata 
taagtgacag accaagaatt tgatcccatg tcctggnggn 
accattanag caggggtttc ctctgggggt tctcttgtcc 
tctggaaaca tttttcgttg tcacaaatga get 

<210> 148 
<211> 451 
<212> DNA 
<213> Homo sapiens 

<400> 148 

ctgaagagca ttgaccaagt tattatcttc aactctctca aaggggtgaa gagagaaaag 60 

caacactgag tcaactggct ggnttttcat ccctttctct tcttcagttg tgggctggag 120 

agagatgtaa ttccaggaca ttggccagcc ttttgttatg tggatacget ttacacaact 180 

acagtttatc catcagaatg aaatacagac aaaagctgag gaaatcagtc ttcttaatag 240 

atagaaagtg atcctttctg cctccaaata aaactgaatt ataacattct tegtatttet 300 

ctgggtacac atctggttta aaaattagaa gttaaatttt aaaagtaggc agaaggtttg 360 

gtttttagaa gaaaagacat tttaactgta atagnggatc attattttaa tgcttataaa 420 

gtccaatcaa agataaatgt caaaccataa c 451 

<210> 149 
<211> 351 
<212> DNA 
<213> Homo sapiens 

<400> 149 

cnaactgaga aaagcaaaag atatttgeca atgaacaata acctggatgc tcaaaggatg 60 

ataaccctga ggttgaggga taccaagtac cttgtccaca attcagcaac aatgggacag 120 

gtgtgataca aacctctttt tccatcttgt tctctttctg cttgaccatt gcaccattga 180 

gagaagtgaa acttgggctg agtctacaag gggcacccaa aataaccatg gtgtgtttat 240 

gttcatttaa aatcataaaa tttgtgtagg aaataaaaaa aaaaggceng egaggecnat 300 

tcagcttgga cttaaccagg ctgaacttgn tnaaaagggg gggcctccca a 351 

<210> 150 
<211> 244 
<212> DNA 
<213> Homo sapiens 

<400> 150 
ctctggggag ctcctgcatt nctacctncc 
agtggacttt tggccacgtg gatgaggaat 
aggcgcctgc tgaggacagc agggctgett 
agcactgcct gatggectae cctggactgt 
ttcc 



atttacccca 


gaaatcactg 


180 


gagtggttta 


eggagcagaa 


240 


agcactagag 


gaacctcaga 


300 


cacaaganaa 


gggagcctcc 


360 


atgcactttt 


caaagcactt 


420 






437 



agaattgaga 


gaaaggcatg 


60 


cctacatgta 


cactggaaag 


120 


tcactgtgca 


tggctgaagc 


180 


ctgaggtata 


gaaacattaa 


240 


tgacctgccc 


tccaagacca 


300 


cccacaagnc 


tggggccttt 


360 


ccaggggaca 


tttggcaaca 


420 






453 



ttnagatana nctgnnggct 
tgaagcagtc agttctgatc 
ggcaccctgg gtccctgaat 
tgectgagae agaaataaac 



ggaatgtana 60 

tagagatgga 120 

ggctctgtgg 180 

ttttatcttg 240 
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<210> 151 
<211> 573 
<212> DNA 
<213> Homo 



sapiens 



<400> 151 
gttttcaagc aaantggcng 
gggnccgtgg ctccnaaagg 
gaaaagcaat gngtttacca 
gaagcttggg ggaaaaaatt 
aggttgtggg gggggtggga 
gaccttggga aaaaaaatgg 
tttgggtttt caatttggta 
canttgnaaa gaaaagccan 
atcctngggg gaatttcttt 
taccnaaaat aaaaaaatgg 



taattggaag aaggnaaaac 
tnattcggtc cccgggtttc 
tttgggcgga aattttgaaa 
tgtttgttgg aaacctcaca 
tcaagcccca agaattttga 
gccaagaaat aaaatcttgc 
cttggaccaa ccttcaagct 
ggaaccgaaa aaaacccccn 
tttttnttaa gggggatggg 
ccctcatggc aca 



gcccagggtg ccttaattta 
ntcaacttgt ngaatggatg 
aatcattgga tggaccacaa 
agggcaaggg ctaaaaacaa 
ccgtngccaa acctcaaaaa 
tttccatccc cgcccaaggt 
tgggcanttc attngggacc 
ccnngggang ggggaaaaaa 
taaantacca ttattatttt 



<210> 152 
<211> 845 
<212> DNA 
<213> Homo 



sapiens 



<400> 152 

gctacgatgc tggnntaaat ctttggcntg gcttggctca cttcttttgg gggtccacca 
cttggccttt tattgaagct tggtaancac ttcnaccant ggaanggggt cttggcaagc 
tttcacttcc ttggaaagcc caggcggaag aacccacaaa aaccccaccc gggganggaa 
atgaaacaag ctggcaagga acgccgccgg ccctttaaag atgcctggta aaccacttca 
cccaaggaaa gggtcccgca agctttcact tccttaaaag cccaagccga agaaccaagg 
gaaacccccc acccaagaaa gggaaaaaaa aactcccgaa acaacatctt gaaaccatca 
agaaaggaaa caaaacctcc cgggaacacc gccttgccct tttgaagaaa cttgtgaaca 
cttcaccccg tgaaggggtc ccgccggctt tcatttcctt gaaagtcaag tggaagaacc 
aaaaganacc cacccaaatt cccgggacat tgtttccttc actttccttt taataagctt 
aatttaaaat ggtgaacttt ttctcggagg ggttgggctt tttggaccat tncttttggg 
gaaaacaagc acttccttaa tcaaattggt caccctttnc ccttgctttg gggtttttgn 
ttatttaanc cactttattt gggccatctt cttggggcca naagaatttt attaagccnc 
caatttaaaa tantcccatt ttggcttacc caagccttcc ctttcattat taaccccctt 
tgccccaatt aangcaaggg nccccttata aaaccaaaat nnggggcttg nggaggccaa 
aaaaa 



<210> 153 
<211> 582 
<212> DNA 
<213> Homo 



sapiens 



<400> 153 
gtgcctgtct gaaaaccagt tcctctatga 
ctggaagcca gactagtgat atgcaccttg 
gaatccagag caactttctt agctggagtg 
atacaaacag tggggctctt tgcaaaacac 
gccgaaaagg acatttctgc ctttctctga 
atcctcccta tacctgaaga aaaggaacat 
aagaatctga acaaacaggc cttgcaaaat 
ctcctttttc cggccatact tctccataac 
aacccatctg gtttactggn tggcttgggt 
taaaaacnta cgttaaaaaa aatggggatg 



ctgtgatctc caagtgatca aagtcttgtc 
taccttgctc ctcaaggcac caacaaatag 
gcttctatgt ttctgactgg actttcacgg 
tcttctaagc tttcagaagc aggtcataaa 
agcaggtcat aagtccctca ttagagaagt 
ccttatctat gaagacacag gaactcagag 
gccctccagc ttcctgccat tagatcatac 
tatccacttc ttcatcagat ctagcataaa 
cttcatttnc ttatgaangc tccgcatacg 
cttttctttg gt 



<210> 154 
<211> 627 
<212> DNA 

<213> Homo sapiens 



<400> 154 
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atgcatcagc agaacctacc acacggcacc tactgcgggc ttcagttttg ctgtagaacc 60 

gagaaacatc acgttagatg ctttagcaac aacaatgtat atgttgcata gaagaaaagt 120 

gtcccagaag aacagccagc tgtcctttac atgaaattgt ggcactgcct gtaagaagta 180 

tatccaatga gaacttgtcc tcaccatgta atacttttaa tgggtgagcc atttcaacac 240 

tttacatact gccgagtaag tttctacaga actttctcat tgtactcagc gctgtctgtg 300 

cagttaattt aggcatcaga aaactcagtt gttaattttc tgacttgcct ctggactctt 360 

aaatgctatt gctccaatca taacacgtcg gaacacttac gcagatttca acaataatat 420 

ccacagctgg gaataaatca aagcaggttt atcactggat aagtgctatt ggaatatggg 480 

taccaagaca acatgaagca aaggacagat ttcactttag aagattaaga cagagccctg 540 

ggggggaaaa aaaagaggta atcccaacaa agtctatgca accnttaaaa aatattattc 600 

agagcagaaa tgcagaattg gcctttg 627 



<210> 155 
<211> 598 
<212> DNA 
<213> Homo sapiens 



<210> 156 
<211> 284 
<212> DNA 
<213> Homo sapiens 



<210> 157 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 155 

caaaactgaa aaactggntg accttncgct tngnntncaa caaaccaaga ctagctttga 
ctatgacaat nggtatctaa ngaatgccag acaggatgga tgaagaccag gacacaactc 
actccaccaa actgtgatgt tacgtcattt accttggtcc ccacccactt tgcctttgaa 
tgaagacgtg tccccagcnn ttgganaacg agaaggaaac acgccaaatt aaggtcnnat 240 
ttacatcaac agagaatata gaggctcaag agaggaattc acttaactta taggaaaacg 300 
aagtcatatt ttggcacatc gagtttgtag tctttgagaa atgaaaatcc tcancaaaaa 360 
gcttttgtct gaccagctgt gaggtaagaa tgtgcaagaa gtcaaagcaa gcgaggaggc 420 
ggagccggta ctgtcctgga aagcaaaacc cagaaaggtg gcgaatctgc tccaaagctg 480 
cctcttttct gctcctaagg aagatgcntt ctcangatac agggattttg tgtatgaaaa 540 
aaaaatggcc atagctgctt acagaanaga atgggtggna atgccaattt ttgactat 598 



60 
120 
180 



<400> 156 

aacctcaggc caagtgttct tgacagctca tccacagact cccactggta aagcagcatg 
aggatggctt ctgttatttt atttcagaat tttttcctgc agtggcatgc cagtaccagc 120 
tgaggatcat gtatgcaata tttgccttct ttcatcttct acctaggatg gctttaattc i fin 

tcttcgagga gaatttattt tagtttttcc cagtaagaga atccacttct cttgcccata 
ttcataaatt atcattaaaa attaaacttg gtacaataaa tatt 



180 
240 
284 



60 



<400> 157 

ggctaccctc gtgntganat gaatnaactg gcncctggng gccgaaaagc gaggngccnc 
tttgttttgg gagggncccg taccccgcgg gaaacccttt tttgcccgaa ccaagcccaa 120 



gcggaatggt ttggtcttcg gcctggccaa ncnaagcccc cccaagangg ggccaaagct 
tcttggtgga aactaagtcc caccttgttg cgggaaggcc cgggggtcaa ncccaaaagt 240 

360 



nccccggnca nggccaagca atcggtcatc gggggcccta taagcnggga aaagaaagaa 
aaagccacaa gncaaagtat cttggcttga aaaaaatggg ggggnntant aaacgggaag 

tcttcgcccg tgtcaccaag gcttgggaag tgtgccaagt ggatgaagaa tctcagctca 420 

cttgcaaacc ttcacctcct tggggttcaa aagtggattt ctttcttggc ttcaaccttt 480 

tcccaagtaa gcttgggaat tacaagggcc ccggnccacc atgcccaagt attttttggt 540 

gggccaagaa gggangggaa aanggaaagg ngggggtacc ttggaaaacg aacaagcttc 600 

ttttcccctt ggggaacttg gnaagcaatt nccgaagcac caacaagtcc aaccccggcc 660 

aagccttttt ggtttccttg gcacaagtct tggncntntt naaagaaacc aacnaacttc 720 

cattattttt attggacgaa tnaaaaaaat ttgggtagg 759 

<210> 158 
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<211> 501 
<212> DNA 
<213> Homo sapiens 



<400> 158 

tcagaactng aggcnaccct tgccaaggnc nnctancccc ttgggggccn tnactttngc 60 

cntaagggcc ntntngncnn caancccttg acnaaactta anggagtccc ntcgaaaccg 120 

gggccaccac ctttcttcac cttttgcaag gcaaggaagg cccggaaggg ntaagccctc 180 

aagcgtcaac gaagttcaaa agancttggg ttacccagca agtttgcccc atctgctcaa 240 

gggatgtggg ccttcttctt gatgaagtaa gttgaaagtg cttgggatgt gaaatcaagg 300 

aactcggagc tcaaagttca atgaagtacc ttggaaaatt ggattgggga agctggccca 360 

aggaaaatca ggaaagaaaa naagtcctga agattcaagg aagaaagtaa aagcccgcct 420 

ggcttganaa tgggggtggg ccanggccaa accttgatca agggcccgag caaaacccgc 480 

actctttcca aataaaagct t 501 

<210> 159 
<211> 736 
<212> DNA 
<213> Homo sapiens 



<400> 159 

gntaccnact ngnacccagt ggatnnatca ancacgaagc cctcactttt gacntcttng 60 

cannngngna aaatttggag ctgggatttc attgcccatg ggcaagatgg ggaananggt 120 

tancctttgg cttananaca aggangggaa aaacccaann ctttnaccan aaaagaaanc 180 

ttgganattc tttggggttc ttggaacang aaccggtttt acctgggcat tttttttaac 240 

aaaaacnacc ctttaacttg gcttatttaa cccggcctgg cttcaatcaa cccacccttg 300 

gggccctggc ccccaagtgg gccaatantg cccttcaccc aacctattgg gcanttaagc 3 60 

ccacaaggcc caaagaataa acttataata tcaanaaatg gaantaagaa aagaaaaatg 420 

tggttcactt gggaaaaact tggcttggtt ggaagcccct cccaatgggg gaagcttgaa 480 

ggagcttgtg gtctcttgca aggccattgg ggggaacttg ggcccacaaa gccaaaagaa 540 

gtcaagcanc catggaaagc ccccnggagc ttgtaaccgg tgtgcaacca aggccgccca 600 

attccaaaca agcatggggg aaaccaacaa gtnggncgcc aaatcatttt nctcaattta 660 

ttngggcnaa aaaaggnngc tatttttttc acccttgggt aaggtggtng cntttttgga 720 

gaaacttccc aaatta 736 

<210> 160 
<211> 458 
<212> DNA 
<213> Homo sapiens 



<400> 160 

aagacataca tcatgagaga gagagattac agtatgcaat ctctcagctg ccaacagaac 60 

acagatgggc ttgggaacag agaatgatcc agatctgcag gactggagca atccgtggga 120 

agtttggaca gaagatctga tgcataagac agtaaaggac tactgaatgt tccatgatag 180 

atatgcttgt tcttttgcct gcatgccctt gaataaagac attttgatct. ccaggaccaa 240 

cctgagaaac atataattta atctagtttt gaaagaagag ccctgctaca caaatactgg 300 

ctcacaatgt taacagatat caactgaaat atcaaagggc tttcatattt cattaaattg 3 60 

actatcctat gtgtttgata tttccattta attgaatatt tcttaactca atgaaaaatg 420 

tatgagcctg ctgtgataaa tcccgtgtcg catatggg 458 

<210> 161 
<211> 264 
<212> DNA 
<213> Homo sapiens 

<400> 161 

cagaaattga gaatcatttc acttttgggg gaacgggaag ctggttgtgn accaccctta 60 

tgtgnacctt cctgtccttc agctacatcn gatgaacctt gggcagtgaa ttatctaagt 120 

cccatccaag cttccagaaa gaactgcagc cccagctgac agcttgactg caacctcatg 180 

aatgtttctg agctaggacc acccagttgc ttctgaattc ctcaccctca gaaaactatg 240 

aatacaataa atgctgatta tttt 264 
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<210> 162 
<21I> 882 
<212> DNA 

<213> Homo sapiens 
<400> 162 

agtcaganac tngaagccca tactttccca attgccttcc aagcttgttt gcaccgggan 60 

ggtttcaaca atcantattt ttccaagaaa nggcttcctt gggaaaagan ngtggaaata 120 

ttggtggtcc ccaatccaag aaaanccttg aatggggggg anttggtgaa ctttgggctt 180 

gcttgtccat tcctttcaat ggtcaagccc caananaaan atctggtggt caagccccgc 240 

cacaaaccat tacttggttt aaagccaagt ggggaatgaa aaagtggcca aagccttgcc 300 

caaagaaaaa aatgggtaaa agggaaaaat gtttgccccc aagggaaaga aaacacccat 360 

gggcaaagat nggaaaccaa gtaaaccagg gggccacaat caaggggggg anaacaccga 420 

aaacattacc gggcccanta aaaacttcct ttaattaaga ananngtcta ccaagattaa 480 

aatctancag atgaacanat tcctcaaagt tgggaacttt gggcccattg aatttgggnt 540 

tggtcccttg ccattactng atggaaaact actggatggt ccaagcttgg gtctgaaang 600 

gaccccttac ccagaaagcc ttaaattcan tcaaaagaaa atggcaaatt tcccattatn 660 

cctaaatgga attcaaatct tccctttacc ccttggaccc caatcaaggn ggggncccaa 720 

aaatttttcc caacccccct ttggccttcc ccaaaaaacc ccccaacccc caanaaaccn 780 

tcttttaaaa aaaattaaag aaatctttcc ttccttaact ttccttggac ttcaanccnn 84 0 

cccattgtna atccatttaa aacctcntnt ttgcttggaa aa 882 

<210> 163 
<211> 828 
<212> DNA 

<213> Homo sapiens 
<400> 163 

cagatactga gaacacaaca aaaagaacct gtcaccacaa caaagagggg aaagtggacc 60 

aagtggctta tcttgaaacc ttgtgggtcc ttggggaagc ccaggggtga accctgaata 120 

atgaacatct aaaaagaaag cctttctggg aacttcttga aacaaagaaa tttcggtggg 180 

ccctgccaaa agctttgccc aatttgccac ttttttcaaa atgccctttt gggaatgaac 240 

ccaagccact tttaaatctt gaaaaccttg caaccaagaa ctaagcccaa ccacctgggc 3 00 

ccatgaaaac tttgccccct ttcacttgga tctgggaact tcaaccttct tggancccta 360 

acggcttttt aaagccaaag ccacttaact tggcactttt aacaagaaat taaccccaac 420 

ttgggaatcc cttgggaacc caacaagaaa ttccctttca aggaatccct ttctttggct 480 

ggccaagaat ggaaagccaa aagggaaatt aatttccccc ttcaaagttt ttctaaagtg 540 

aatttccaaa aagccaaang nggngggtgg aaaatttccc aagtaaccaa gaaaaccaag 600 

aagggttggc cccaatagaa agtaantttt ttaatctaat aaccntcccc tttgggtacc 660 

ctagaaaaaa ngcttatttg agaactaatg aagctccacc agaaccangg gcctttcgcc 720 

ancaaaacct ccaaaatcaa taaattggga ccatggtttt aaatggatta cctggggaaa 780 

tccntggata ggccctnnna aaaaggggga nangctaatt aaaacaaa 828 

<210> 164 
<211> 660 
<212> DNA 

<213> Homo sapiens 
<400> 164 

tggagaaaat gggattggga aacagaaggg agaagaaact gggcntttac cataagaagg 60 

ttgcanaaca ccctttaaaa acctaacctt ttaaaatggc agtgggaaag cnttcaacat 12 0 

ggaggcctcg tctaatttaa aacaaaccac acagacncac ttggcccaaa agcagcgact 180 

ggcctctgaa gannaaaagg tggggccctg caagtactgg gctgggaacc acctccacat 240 

ctgaaagaat gctgtttgcc tgtatttgct tcccaacgtc cttccttccc ttgcctggtt 300 

gcctgttggg cctaacatgg agctctgccc acagtaagtg tcgttactat ggccactagc 360 

ccataccaag gcatggcctt tgcaagtccc caacatacag ctcccgacct cacaagcaag 42 0 

nccatctcta ntgctggnca gaaagtaaaa gttcacacng ggcggggcaa aaagtcctgc 480 

tcattccaan gnancaacgc accctnaaca agcttttccc aaaangcaac tcaaccactc 540 

tttagaattt tttttttttt tnaaaaaaaa ccgggnttaa ggaacttggc aaaaaaaanc 600 

ccccnagntg gaaaanccct ggggaaaaan tttctgggnc ccccccccgg ggctgaactt 660 

<210> 165 
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240 
300 
360 



<211> 643 
<212> DNA 
<213> Homo sapiens 

<400> 165 

cagaaactga ggtatattag ttcttatatg aatggacaga agaaacnatg gaaattggag 60 

ggaagggaag angaacncnt anangggngc ntantttngc nncccaggtn gnccttcaat 120 

taaaagaacc tttggcntcc agggttcaan gtggattctt tttgcttcaa gccttcccga 180 
gtaagctggg gaactaacag ggtggtcaag gccttcttga ccccaagcct aaagcccatc 
attatccccc tggtggatct tgcacctaac ccatcccaga atggccctga aagtaagtga 
aagantcccc caaaaagaaa gtgaaaataa gccttaactg gatggcattc ccaccattgn 

gaatttgttt ctgccttcac ccttaactgg atcaatgtac tttgaaaatc tccccgcacc 420 

ctttaaaaaa ngttctttgt aattctcccc ancctttgaa aaatgtactt tgngaagaat 480 

ccanccttct ggccgcaaaa cattgctctt aacttccacc gcctatncca aaacctataa 540 

gaactaatgg ataatccacc accctttgct tggacttctt tttcgggact canncccgnc 600 

tgnaaccccc ggtgaataaa aacaagnccc cttgtgtccc ccc 643 

<210> 166 
<211> 629 
<212> DNA 
<213> Homo sapiens 

<400> 166 

tcaganactn ggagngaaga acaagctttc ccaagggctt ggaaaagaag gggggaagtg 60 

ccgggaacca ntgccttccn ccantaacca cttggcccac ttcttggtgg aaccttcttg 120 

gcaagcaaaa aaccctggaa acccccaaaa gaaggcaagc tttcttcaaa aagtaaaaaa 180 

gtgggaaatg gaaagtttcc ctgggtggaa ccctggaaat tccccatggg aagggaaaaa 240 

gatngganaa aagggancat ttattgccaa gggaagantg ggcatctcgt ggtccccttg 300 

ggttgaaacc caanattcca ttaagggaaa gaacgggtgc caagttgttg aagggtgggg 360 

acccttggga cccttgggaa taaaaaatgg gggtggttta aaccaaaagt aatttgtttg 420 

aagtaagggt tgggtgggga agggaaggca ccgactaaga tgcaaggggg tctaagcttg 480 

aagttggaca aagaagctaa ccaccagggt tgttgggacc aagggacagg ggggggaccc 540 

tttaaagccg aaaagaacac cctgcccaag atggtggtct ttggttccct ttgacctggt 600 

gggagaaggg cccctttggt gggggtggg 62 9 

<210> 167 
<211> 276 
<212> DNA 
<213> Homo sapiens 

<400> 167 

ggtgaagcca gatgggagtg ctgagcttca gggagcagct acgcaaagtt aattgtgctc 
agcaaagtct tctagattaa gcggtcgctc caataaagtt tcctgattct gtccagaaat 
cctcaactcc gacaataaga agtgggttga ggggcagttt gaatacataa tcaaaaagca 
tataattgaa gattgaactt gagctatagc ttcatgtatt gtctctgcgt tgttctattt 
taatagttgc atatggagac aataaagcta catgac 

<210> 168 
<211> 299 
<212> DNA 
<213> Homo sapiens 

<400> 168 

agacgtctgg ggagcctacc tgcattaagt ccanatactg gagagaaatt caagaacctt 
ggaaagctta ccccaacctt tcttaaccat tggcctanta accnatggan caccccttaa 
ggaangtggg gcaggaagta acccccggan ggggaaagaa acccctgggn taaccttgga 180 
aatggactan tattggaaaa caacanggtt ggcctttana taaccccttc ggantcaact 
tcaacttaac nggaaacttc ttntaaataa aaaggtanta atttttttaa agcccaatt 



60 
120 
180 
240 
276 



60 
120 



240 
299 



<210> 169 
<211> 540 
<212> DNA 
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<213> Homo sapiens 
<400> 169 

atttctgtga atagaccaga agcccgacct ttacagtgtg tttggggtgc agaaaacctt 
ggctgacata ctcaaggctg aaatgcagtc agcggaaatg gaaacacttc aactctgccc 
ctgtggcaag aatggcttcc cttcagacaa tctggccaga ttctttatgg acccaatggg 
agaaattgga tgcttgtata tacctctcag catctttgaa ggggcactga aacttcaatc 
aaattgggga aagggagccc tgaactttag acctgtttta aatgtgcaga gtggcaactg 
gcacaaggaa cactttccat ctgtaagaaa gaatacaaag aacttggaac aagaaaaaag 
tagatatctc atcagtcaat ggtgctgtat aggcatgcac aaagatggag atgtgagcac 
cgacaagatg gctggcatct ataaggcagg aagagatacc tcaccagaac cccataatgc 
tggcctctga cagtaaaatt ctanctgttg nactatgaga aaataaaatt ctgtggttaa 

<210> 170 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<400> 170 

ctgaatgaag acaaatctta gccctctgag actgatggtc tcagaaagta gtcttcagat 
taccagcttc agaatcagct gatgggttca ctaaaatgca gattcccagg cccagtgagg 
actgaataaa tcttagtttc ccaggcttta caggaaccat ggtgctcagc ttctaaggag 
gcctcaggaa acttacaatc atggtggaag atgaagacgg agcaggacac agagttcacc 
ctctctggag aatgtagcca ccaggcacca tcttggaagt gaagactgga ccctcatcag 
acaacaaacc tgccagtgcc ttgaccttgg acttcacttc ccagcttcca gactgtgaga 
aaataaactt ctgttcttta t 

<210> 171 
<211> 334 
<212> DNA 
<213> Homo sapiens 

<400> 171 

ataatgacga ctgcaaaatg gcaggataag gaccgtccaa aaagcctcat tgatgaaagc 

aatgagaacg ctggcaaaaa tgatcagaat cggctttttc agacctctgg aaattaacca 120 

aagatttgca gtgaggaatg aaatttcagt gaaaagcaat atcctagcag ccactggggt 13(1 

ggagaactga agccgagctc ccccaaagcc tcttcccgga gaactgtcat tatctgagct 

gcctctctgt tccgtggaag actctacttg caagactatc tttgcctgat tgactcggag 
cttaacccgt aggaacagcc caggggcatt tgtt 

<210> 172 
<211> 351 
<212> DNA 
<213> Homo sapiens 



<400> 173 

gcatacctca agatcagttg aattggagca cagctggatg gaggcctcag gttaattaac 



120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
381 



180 
240 
300 
334 



60 
120 



<400> 172 

aacagttcta gatctccatc gttataaaag agtattaccg tgttggtgta ccacaatttc 
tcaagaaaaa cattagctaa gcccaagctg gattttgatg gataacatgc tgatgttgta 
acaaggctgg agcgtggcac atctcacaca tgcaggtgaa cacccaatta ccacgcctat 180 
gaactacaaa atcatctaag cagattttaa attagccagt tgtttcccta ggatcctcca 240 
aaggtgatca atacagtttg tttttttctt ggtggaggga tctcatgatg aactaatgaa 300 
tcttaacatg aattgtaagc aaataaataa aatggtatgg tttaagccat t 351 

<210> 173 
<211> 376 
<212> DNA 
<213> Homo sapiens 



60 



gcatacctca agdtuciyi.Ly a.o.i_i_'yya.yv_c» ._._^._>_aa~-» =>— = = too 

ttcctttgag agcatccaga aaattagcaa ggacatgaga aaccattcac tcaggacgac 120 
caatcagcca ggacactccg aaacctatta aatcagattt ttaatcttct aagcctgtag 180 
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acaactgtgt gacatcagcc acatcctcaa 
atgtgtgcaa ggaatcatgc ataaaaggat 
atttgtcatc aaatgtgaac ggagatatgg 
agtgaatcac ttcttg 



atcttaaggg aaacacgaat acaagaatac 
tgtgccttca gatcaagtcc aactgttttt 
gtactagtcc caggaatgcc ataaactagc 



<210> 174 
<211> 513 
<212> DNA 
<213> Homo 



sapiens 



<400> 174 
atatgtattc tgcaatcatg accaaacaga 
taaaaaggct acttgtcaac cacgccattg 
gctgcacatt cctgggacgt gaataatatc 
ttaccagccc tgtgaacact gaatataacc 
gggatgagtg ctggggcagc agaggtccac 
ttcagacaca ggtttggatc ctgcagggct 
ttccttgtat aatcaccagt aagctgagaa 
actcctcttt aaatgcattg caaaggagat 
caaataggta ttaaagtatt aaatgaagtg 



aggactaaat ctggatcaga atctgaaatg 
ttttccgttg gagctagcag agcagcctcg 
ggttgtgatt acacttcagt atctcatcca 
taattaggaa atgcgaaggg ccctttgcta 
atgccttccc gacacaggga ttcaccgggt 
caaggacaga ctttactggt ctagtccaca 
tgtgacacct tggattccat cctatgttac 
atgccaggac ttgataagtc aagtcaattt 
att 



<210> 175 
<211> 432 
<212> DNA 
<213> Homo 



sapiens 



<400> 175 
gtatgttgca ttgtacaaga tgaagttaga 
gccagaaaat actgcccaac cagctctcaa 
ttcaaatgag aggtggagac ccagctggca 
aagcagaatc tcagactggc aagatgcaag 
gccaatcttg agcaactcta gaagagaaac 
aatgaactgg ttttcattac ttcctgagtg 
aatgtatacc attctcagtt gtctatttaa 
tatctacttc tt 



gtgtgaagca tggaacaaag tgcttattga 
ggcaaagaga gggtgtacga gaagctaatc 
gctagcatgg tgcggcgtgt tggaggcaag 
ggcaggcagc ccacccacag ggaaggcgtc 
ctgaacacat cagaactcaa actaactgat 
atcaggaggt agaattgtct cttacaaccc 
ggatttctta gtgagctcca tggtaaaata 



<210> 176 
<211> 387 
<212> DNA 
<213> Homo 



sapiens 



<400> 176 
aggggcagac ccaggtggga gtactgcagg 
tcttccgata ctagcaaggt gtgcttggca 
tgacatcact caaggtgtga ggacccagca 
agaggcccat ctgaggagcg gacaggcagc 
gtctttgagc cattccctac tctgagtcac 
agcttcccat tcctcatact tcccctgatg 
aaatctggat tattaaagtc aattttc 



ccacgcccct cgaagacagc atccacgtgg 
gccggtgcct caaggattgt tctggaagga 
gacagagcac acgccctggc tccatgcccc 
ctttcccacc agagtcacca gggtgaggac 
aacctcgtag ctgattaagg ccacatggga 
ctctcaggaa ggacaatttc gggctgaacc 



<210> 177 
<211> 420 
<212> DNA 
<213> Homo 



sapiens 



<400> 177 
gttgctacaa taattccagc tgtgtatacc 
aagcatctta ctgaagaagg cccctacgtt 
gctgtcccac acaatatggg ccaggcgatg 
tcaaagtgaa gctggtggcg acttcagagt 
ataacctttg taaaagtaaa ctatgatcat 
actgcttttg gaacctgtct cagtttttat 



tcctgggatc ataatagaaa tgaacctctg 
gactgtccag ctgactgtct ctacccgact 
gtattgcctt tgcaaactaa atgaagttcc 
taacttttca aatggccggg cttatataga 
ataataagat acatgtgcat ttggaacgcc 
catcatacaa ggttaattgt ctaatgtcaa 
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ttagatttta tcacaagtgc atttgggtcc taatctggaa caataaaagt ctattaaacg 420 

<210> 178 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<400> 178 
ggcatcttga agcagaccag ccacgttgca 
gctgttctgg gagacagaat cctatagcat 
ttccagcttg atgtctcagc cagcgggttc 
gacagagtga cccaggtaac caggaaaaca 
aacctcctag attccaagga ggaaaacgta 
catctgaaag ggagtccggg gaattgctaa 
ctggatgtgg ccgcctgact cgaattcctt 



<210> 179 
<211> 115 
<212> DNA 
<213> Homo sapiens 

<400> 179 

aatacgttcc agaggacaag gactgtgttg ttcatcacag tattccagaa cttaaaagga 60 
actggcacat aattggagct tactaatatt cgtcaaaaaa atgaacaaat gaggc 115 

<210> 180 
<211> 449 
<212> DNA 

<213> Homo sapiens 

<400> 180 
ataagagtga gcatttttgg aaatgtgatc 
ggaagaatca ggaggatatc ttagaagata 
actaccttac acctggtttc cacaggagaa 
taggaaaaat gtctatggtc tcttcagcac 
tctcatctat gaaactggga taataatatg 
atagcatgtg aaagcagttc tagattccag 
tctcaatttt cttgtgtcat tgctgctgcc 
gccaagtaat aaactaacac atctaatcg 

<210> 181 
<211> 506 
<212> DNA 

<213> Homo sapiens 

<400> 181 
gtgattttag aggaataaac acccttagcc 
caagggaaag gagctcactg aaggcccatg 
cacagcctct gttcccaggg ccacgctcct 
ggacaagaca tccttttttc ttctagaaca 
cattgatact ttttcaaagg cttcacaact 
tcattatctg gccaacttta agaatgaggn 
cagaagactc tgcacacttg catatctcac 
tcctcattgg ttcacaaaaa aaaaaagggc 
gggtggaatt tnttaaaagg gggggg 



agtgcttgga ggcacggatg 
ccccagtcct gcagcacaca 
ccacgtcctc cccgcctctc 
aggccataaa aaaggaactc 
gctctcagac caagtccgtt 
ttttgaactt tctatacacc 
tgcacaataa aatgaggggg 



actggtggct 60 

gg tgggacaa 120 

ccaggcagaa 180 

ctactaatga 240 

ttcgcccttg 3 00 

cttcctgcct 360 

aaaaaaatca 420 
421 



aactgacgca aaatggcagc 
accacagaat ctttgcaaga 
atggctcaaa atatgttatt 
catctgtatg tagtctctga 
caatgagagt tattctgaag 
acataagagt aagattaaaa 
atctagactt aaacaaatgt 



aacactggaa 60 

gacacagaag 120 

agttgaacag 180 

gtctccagtt 240 

atcaaataag 300 

gaaatgttgt 3 60 

tactgtaaga 42 0 
449 



gtcagccaac attttacaaa 
ctcattaatg aggaagcaaa 
cgatttctaa gcgctgttcc 
acagctcagc cccacctgaa 
cagctttttt ggagacttca 
ttgctaaatg tatcagcatt 
aaataccgnc aataaataca 
ggccggggcc nttnancttg 



tgaaggccag 60 

aacaacagca 12 0 

agtccacaca 180 

agaaagagt t 240 

gcaaaataag 300 

ctgaggntat 360 

tagnttcatt 420 

gacttaanaa 480 
506 



<210> 182 
<211> 510 
<212> DNA 
<213> Homo sapiens 
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<400> 182 



gccccagcgg atggaactca taaataaaga gtgagaaatg caanttatgc cagangttag 
aaagccaggc tccttgccac agcaagaagg ggatagctgc agcccacgga gaaggagaac 
cagtaaagtt agcaaaagca ggcagaagaa gtttctaaag caacatactc tgcaaagcag 
tctgggccac gtactgtagg agcaagttgc cagcagcccc cgggagcatg aatggatata 
gcaactgttg ttgaaaaaga acaatcctga tcaacccaca tcaaaggcta atagacctca 
tttaagaaga cagggaaatg taaatctgtg agatacttca ggatcatttc tatcaaaaag 
cgtttcatat: aataaaggaa taaagcctca gttatctgga agggtcnnnn nnnnnnnnnn 
nimnnnnrian nnnnnnnngg gggccggggg gggccctttt ttttngtttt aacccggnnt 
tntttttttt aaaggggggg ggccccccca 



60 
120 
180 
240 
300 
360 
420 
480 
510 



<210> 183 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<400> 183 

gctcggtgac taggaagagt ggctgaaagg ccccacctct gactcctccc tgcttctgat 60 

agcctgagtc ctgggggaca gagggaagcg cctctgggtt cccctctccg tgtgaggcag 120 

acagcctccg cccaggctct gaggggccct aattcttcct aacagacagc agtttggagc 180 

ttctcccaga gtgacccagg agccagccca ggagtggtct agaatagaca aaggaccgtt 240 

agtatcccga tgtgaatttt agaatgtgta tatttcatac ataaaaatag aaatgtatat 300 

gaatgtaata tagattatat atttattatg tatgtaaaaa cagtatgtgc acatgataaa 360 

tgagcatatc tacgtctct 379 

<210> 184 
<211> 317 
<212> DNA 
<213> Homo sapiens 

<400> 184 

gacccacctg ccatgctgtg aggacaccca ggccacatag agagagtgag gccacatgta 60 

ggtgttacag ccagaagccc cactgaaaac caaacctgca accagcatca actgccaaac 12 0 

atgtactgaa gaggctgaga tgattccagc acttgtggat gactgcaacc acatgagaga 180 

cccagagcaa gagctaccta gctgagccca gttactccca gaatcatgag agaactatgt 240 

aattgattgn tattactata taagccactn ngtttncntn tgatatgtta tgcagcagta 300 

gacagctgga acaggag 317 

<210> 185 
<211> 378 
<212> DNA 
<213> Homo sapiens 

<400> 185 

gtgcagtgaa caaccacgac aggcttcaca tcatcctacc tggtcagaag ttgccaccat 60 

taggacaatt aattaaattc aacagtaaag atgctgccat agttaatgaa tcatgttttc 120 

cctggagctt tccacctatt caaaggacaa gtttcagagc ttggatgagg agcaactatc 180 

ttatgaacac agagacattt gtcagtttta aaggtcaaat tagatttttg ctcaggttcc 240 

caccaaaatg atagacttga aaatcaggat ttatcaaact atgttctaaa ttatttcaac 300 

atatcgagtg tattagtctg ttttcatgct gctgataaag acatacccga gactgggaat 360 

aaaaggagga ttaatttg 378 

<210> 186 

<211> 688 

<212> DNA 

<213> Homo sapiens 

<400> 186 

ggntcccctc tgttgnccan ggctggnagg cnnggggcgg gaaccttnnn taactggaac 60 

cctgggcntc nggggnnnaa ncctaatcng cggtgncntc gggcctggcc aaaggaagcn 120 

ggggaattaa caggtccggc gccgtcaccc aangccccgg ctaaaattat tttggcaatt 180 

ttttttggta agaagaacgg gggggttttt ngcgcattgg tttggcccaa gggcttgggn 240 
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cctacaaaaa antccctggg ccctcaaagg 
aacccaaaag gtggcttggg ggaatttaac 
cccggggccc aagcctggga ataagtnnct 
gcctgggaaa ccctcaaggg gggaaggggg 
cctggggatc ctggacaagg gggncctttg 
gaaagggggg caatgaancc tccgggctca 
gcctcnaaaa gggaancctc cccancctca 
gganggcnaa gganaccaat tgcccccc 

<210> 187 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<400> 187 
gtgactgcct aatgttaaca aagatctgta 
ctctttccta atccttcaag tcatacctga 
gtccagcccc tgaaagacgc agacagtgca 
tcaaggtgat gtctcaagca aaatgaagtg 
gaaaactgga gtgttcgttc atccatcaaa 
ccagctcccc cagacaaagc agtgaacaga 
atcgggtggt gatggctggc agataaaaat 

<210> 188 
<211> 552 
<212> DNA 

<213> Homo sapiens 
<400> 188 

gcagaaggcc ccanaaggnc cgcaagaact ccccanaaag gccngcaatn nntccgncaa 60 

gaagggcccg gcngaacntc ccgcaagaag ggcccgcaag aactcccgca gaaagtccgc 120 

cacacangca aagggaaaga tgcctcccgc gtccaagccc ggcttganat gagcaggccc 180 

gangagccaa tggcgcaaaa gaagngnccc ggtntcccgg atcgggnant cctcataact 240 

ttncctttcn ttctggacca aggtaaagcc cacaagagnt atgggaaaaa agngcttggg 300 

gggaaaggaa ancnggtggc cggaagttcc ttcttcccaa ccaagggncc cactnaattt 360 

atngggagga aacccaaaaa ggcgtttttt ccttaaaaaa cctggaccgg gggaccaaaa 420 

tccgaanngn aacctggacc cacttgcagn accattggga cctttccccn taaacctttc 480 

aaaatctngg tgggaagaag aagggccctc aagaaggtcn ntccactccg cctattntca 540 

atttatcaag gg 552 

<210> 189 
<211> 317 
<212> DNA 
<213> Homo sapiens 

<400> 189 

acttgcaact tatgtttccc ttttaatcac aaagctgaag 
atcatgaagc aggaagaaaa aaaatcatcg acatttttga 
tttctgcaga ataaactaag gctaacaaaa aagacaaaaa 
aacctaatta tttggtggat ttttcaaaag gtggtcagct 
accaatgttt tctaggcaag cctagatggt caacttttga 
gatttgttta tgcatac 

<210> 190 
<211> 370 
<212> DNA 
<213> Homo sapiens 

<400> 190 

tgctgctttt agaccagtcg cacaccaggc cgaagaggtg agagggtgag gtgtttccca 60 

caagaacatc cacatcctca ggatggatgg aggagcaagg acgagaaccc ccaacccccg 120 

agacagtttc tggctccttc cttccaagaa gccctacaca tgatatccac gttgaagccc 180 



ccgaatccca acccccggct 
caagggccgg nggaagcccc 
ttaagtgaat caaanatgaa 
gccctnnacc cttctngggg 
gcttccattc accccaaggc 
acctggcccg ccttggaccc 
agccctcaaa ggaanaannc 



ttcgaaccct 3 00 

acccggccgc 3 60 

cctggngggg 420 

naaaacnnat 480 

ctcaaaagtg 540 

tnccctggaa 600 

taagggacnt 660 
688 



ggaatgatgg gaaggggcac 
agatccgcag tttttctgga 
gagagaagag cctacgtttt 
gtttgtggct gaaacaacct 
gaacaaacgc caacgtctga 
ttaaaggatg ggaggaagga 
atggaacgct tcac 



tggtacttnt 60 

gacaggtgaa 120 

tatatttttg 180 

ccacgggaaa 240 

gccaacgacc 300 

tacaatcaaa 3 60 
404 



aatagacaac tatacgacct 60 
ccatgcaaat gagcattttt 120 
caactgatca ttcgtatgaa 180 
aattatgtgg tatcatctgg 240 
gagagtttat aataaagttt 300 
317 
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tcatgcaaca agctactcat tcctcttctc aaaggaagtg ctgagtgtct ggcaagttgg 240 

aaagaatgag ggattcttct actgggttac ctggtcagct ccgaggagag ttaaaccagg 300 

aaaagtagtt caggctggta tacctccctg tttgtccttg agggcaactt aaaagcacta 360 

tttacacaag 370 

<210> 191 
<211> 427 
<212> DNA 
<213> Homo sapiens 

<400> 191 

catgccatgt ggacgtgacg cctggagata tcgcacccac cttataatca ggaggaagaa 60 

tgccacgtgt ggaggatggt gccacaggaa tctggaagag ctcgatcctg gacgacttgc 120 

tcaagcagct gcacattcct cctgccacct acttctggat attgtgttag gaaactggca 180 

tgagcataca catccattca gaggaggtga aagtggagtg actgatgcta gaatccccac 240 

cttctgagtc aacggtccag agaacaaggc caaaacagcc acaaatactt ttcaggcttc 300 

aggatcaaat tttttattct tgaatgatcc aaacacttta agaaaaataa agtttctaga 360 

ggaaatcaac aaaagtgggn nnannnnann nnnaannnan aaaannnnnn nnggggggcg 420 

ggggggc 427 

<210> 192 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<400> 192 

ctttggtgtc tgcacagtcc cacacgagcc aagcccggct tgcagggtca agctgtcttt 60 

tcatagtggg aaaaagctga tgaaaatcct tcacacagag gtgttaagag cttaatgatg 120 

aacactcccg acctgagtta taatttcaca agaatttgaa ctttattttt ctgcggagag 180 

tcacgtgatt tgtcctgcgt gccaataaaa ctactgatgc cagctggcct gaagaactcc 240 

atgaagatct gactgactaa agaatgcagt ttccaatcct ggtgatttca tcccccttat 300 

cccaagcagt caataacttc tactttccag cctcttgtcc tccacgatcc ccttaaagac 360 

tctagcccaa aactccccag ggagatggat tcgaggattc ctctgttcgc tcactcagcc 420 

actctgcaat cattaaactc ttttctctgc tgc 453 

<210> 193 
<211> 453 
<212> DNA 

<213> Homo sapiens 
<400> 193 

tctgtgtcat gctgccttct gtagcaacaa cggctgntcc ctgnttntgt gccacatgcc 60 
aaactattca acatntgcac atactctcct agtcactctt aagggtgttt cataatgaag 120 
aaactgaggc cgtgaggact gaggggcaat gctgcagcaa tgtcaagttc attcggtgga 180 
ccacgtgcct tccatctcca aagacacagt ctgtgctcct taaatacctc ctgacaaact 240 
caatgtgcag aggcaagata gagcaagttt ctgctgcaaa ctcaccacca gtagtggatt 300 
ctaagcccan ctncctgcca atgattcttt gcagggncac agcttctgtg cctgttcacc 360 
tagggctggn tnaccacagg ganggancnt gattggggaa aagcattggc ngtnncagaa 420 
tggaaaangg gacctcaaaa ttttgtctta ggg 453 

<210> 194 
<211> 473 
<212> DNA 
<213> Homo sapiens 

<400> 194 

gcttttggca tctccattca ttccggaaca gccagtcagc cctctctgct gtgtcccaga 60 

gcaccaggaa gtgagtaaca gtcctagagt gagacatgga ggatacagcc aagtatcaga 120 

ggagtgctgg ctcgctgctg cttctacacg tcaccgtact gggggaatcc tatgtgaagc 180 

cgccccatgt cctgtctgcc tggatactca ccatgcagat agctctctgc attcagcagg 240 

gtctggctta ggccctctcc tgggggccgg agacccctct gttcttctcc agaccctgca 300 

gaattctgga gaggagagga aggtggaaca cacactttct tnctgctttt ctanggtgnt 3 60 
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ggggcatctc tcttcttctt ttaactacga acttcacagn ccaaccactt tctctttttt 
acaagcccct tggggtcctt caagaaccaa agtaaaaaaa agctttaaaa atg 



420 
473 



<210> 195 
<211> 127 
<212> DNA 
<213> Homo sapiens 

<400> 195 

ccattgacct ggatggacct aggacacaca ctaaaggaca catctggatt caccaaggag 60 

ctttttatat ctcacaaaat agcatgttgc taataagaag aataaaatga aaccaaggta 120 

caaaatg 127 

<210> 196 
<211> 311 
<212> DNA 
<213> Homo sapiens 

<400> 196 

agaaagaacc ttcaggnntn gggaggtggg ncttttcntn cntnaaaacn atgatncctt 60 

gggtganccg nnnggattgn cccacaancc ccgatggaaa cattcaanag gngaatgcct 120 

tgctcanaac cccctggcca ggcttaggag ggaaaaanta tgctttccaa ctntggcaag 180 

aaattgctgc atccanaggc tgcagaagcc ccgaggagca tgaacatgct ttggaagaat 240 

angcgctgcc ttgagtgaca tcctgaacca gacccttaca cacacanctt tcattggtgg 300 

cttttggggt t 311 

<210> 197 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<400> 197 

caactgtgga agtcaaggcc agaaatcact cactatatca tctgatattc ctctgatcgt 60 

tatacctatt ctcagtgtta aggaaatgag accagttgaa acgtccacat taaaataaga 120 

agaaggagag aaggttttct aattgcagtt aatgtcatcg ttaaataaag aatgccataa 180 

aggaacgaga tcagcagtga ccttctgcac agtttccaaa gcctcgccaa cctacctccg 240 

tgtcctggtc tgacttatgg cagaaacaga agttcaaaga cctggctgat atgctccgtt 300 

aaaaaccctt ccacaacgca gttaacattt tctgntttct gactttcttt ttctaaagag 3 60 

atgcttaaag caaaaaangg ttcctgcccc aaaaatgaca ttaatatttc gtaaatcaag 42 0 

aactaagata atggtttngg ctgctacaga gaccgttacc cttatgcggt tatctnaaag 480 

cttttcgatt aaaacac 497 

<210> 198 
<211> 350 
<212> DNA 
<213> Homo sapiens 

<400> 198 

atctgaagag aagagaaacg tgagggaaga acaggcggtg gcagccggaa gagagtgggt 60 

ggaacagtcc ctgcaactct tcagagaaaa gaaaggggcg ctggcccagg cccaagaagt 120 

gtccctgggg gccgatgtcg gcaggaatcc ccgcatctcc acatgcggaa ctgagagaag 180 

tgcctggcag attcaatcat acagtgactc aaatgtcaca gcatgactat agagaaagaa 240 

taatagtgga agcatcccgg ccaattttca acagaagggc tcaggataag gaagcttaag 300 

aaaattgccg aagagaatga taatgacaat aataaaaaca aatagcttcc 350 

<210> 199 
<211> 275 
<212> DNA 
<213> Homo sapiens 

<400> 199 

caggtgaata aggtgggatt tgaaatcagc atggcagtgt ccagtggaag aagggagctg 60 
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aagtttcttg aggatgaata taaagctggg ggagttatca ttgagcctaa ctctctggtt 120 

tggaacccat aaaccctaat caatatacct cccaagttta caatagaggt gagtatattc 180 

taccttactc catttccatc ccaacttccc cactttgtaa actttcagaa ctgacttatg 240 

gaggtttata acagccagat atcaaaccca tagac 275 

<210> 200 
<211> 354 
<212> DNA 
<213> Homo sapiens 

<400> 200 

agaaagagga aaggaccagg agtggcgacc ggcaaaccac agcttgtgtg ggaaggaaat 60 

ttgacatgtg atgcaagcgg accgtttgtg taaactgctg ggagattaac aacaactgtg 120 

agtggaattg ctgagtcatg tggcaaacta ccagttctgt tgaacctcag ggccatcatt 180 

ctgttcatgt cagctcgttg tagaaccaca tcgatgaaga ccaagatggt aaagatgaaa 240 

aattgtagct aacatttact gcacatttac tacaagccaa gcattgcact atgaagttta 300 

agtgcattat tcattaaccc cttcaataaa atttgtaatt ttcacttcag aagc 354 

<210> 201 
<211> 310 
<212> DNA 
<213> Homo sapiens 

<400> 201 

gttggctgat tgtggaggct aaagcaactc taccttgcca gcttatccac catgtggact 60 

tctaattaat ctcagttgcc ggaatgcctc taagatttct acgttatcta ctgtgaagag 120 

caagtaatta ctgcaaatcc tgcccttggg tcaaaacaac cttgatgaca tattccttct 180 

gaagcacata tactctttcc ctaggtatat aagccttggg tctgggggct aacggtgcag 240 

ggatccatca tctcacagcc acccaagaca tggcttttgt tcaaaaatcc ctattaaatg 300 

tttcattctg 310 

<210> 202 
<211> 446 
<212> DNA 
<213> Homo sapiens 

<400> 202 

gtggttacaa ctgtggccgg ccactgtcct aacaagtcag aagagagatt ctttgccaaa 60 

atcttcaggg gaaacgacac gagtaccctt tgcttttcct caacgaactt cccttctact 120 

tagggtttta gggcatttgt acaaatgatt tgttccttgg gtctgaatct tggggatgtt 180 

tatcattttc gttgctttca gaaaatagtc tgcatttcct tctattacct ggaccatttt 240 

cctggctttt taaaaaaaaa ttattattca aatggaaaag cggcgagccc agaatgagcc 300 

gacgaattga gctcttcctt ctctcgaaca cgggggcacc tctacccgct acagacttga 3 60 

agattttact cacttccttt catcccctcg ctcgggtttg gagggtaggg gcatgaagtg 42 0 

gntgaatcta aactggcaga aaaccc 446 

<210> 203 

<211> 88 

<212> DNA 

<213> Homo sapiens 

<400> 203 

gttcatatca tggatcccat tttatagatg ggaacactga ggcctgagtt tacacgagaa 60 
tttgctgaag aggagaagga aaaaaaaa 88 

<210> 204 
<211> 211 
<212> DNA 
<213> Homo sapiens 

<400> 204 

ggctttttca ctcattccct angcatgtgg gacctcnaag atgccgaatc agctaaacgg 60 
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gaggnggctt gagtangatt tgctgccagc taaagcgtga gatgctattg catgtgcaag 
gcaaggcttt cttcancggc atcatcttnc aaaatagccc agngagcatg cctttctcct 
gaaaaataaa aaatagttgg tgtttactgc g 



120 
180 
211 



<210> 205 
<211> 245 
<212> DNA 

<213> Homo sapiens 
<400> 205 

agttccccaa ggacagaggt cagggaataa gagctgagtg agacctccca aagcagatca 6 0 

caaagagaag gggacactgc accatggagg tgacacaggc cagtggccac ggtgctggac 12 0 

ctggggctga gaggacccac atgtatatcc tggccgattt aggtatctta gactttctgt 180 
gcctcacttt ccttatctgt gaaatcagca ttctgatcat gactaaataa aaattgctgc 240 
cattg 245 

<210> 206 
<211> 325 
<212> DNA 

<213> Homo sapiens 
<400> 206 

gggtatcctc accttgtata atcctggaat cacacttctc tccgcgtaca tgctggcaga 60 
gctcattctc tccacttggg aaggaggcta caacttacag tgtcaagatc ttaccagcgc 120 
aggggaagct gacatccgga ggaccaactg aataaaccac agcacatcca cgtagcggat 180 
gcctctacca agtggagtga ggaagagctc tataccgcta cagaattgtn tctgggatat 240 
agttacatga acaaaagcaa cttgcagacc gtgtttatag gatagcaccc tttgtgcaat 300 
aaatgatatg aatgcaaaaa aaaaa 325 

<210> 207 
<211> 232 
•<212> DNA 
<213> Homo sapiens 



<210> 208 
<211> 159 
<212> DNA 

<213> Homo sapiens 
<400> 208 

ccttgaatat gagcatgctg catgctgcag cagtatatag tgatcaaagg caacaagcca 60 
aggatgatgg aagaacaaga gagaagcagg ctggttcttt gacattggac agccagagtc 120 
ccagccctgg atggcctgtt ccagacatct tgtcaagtg 159 

<210> 209 
<211> 329 
<212> DNA 

<213> Homo sapiens 
<400> 209 

gggtgcgatt tactggtgat gagctctggg accttcaata ctaccagaag attgaggaca 60 
tatcagggga gacctgttgc ctcacttttg tcccaatgta tgacctgttt ccacagagaa 120 
acatgcagga gaaattgcac agatagaaga actgaattaa caatctccaa gactgctgag 180 
tggttttgat ctgccttgct tactttttca gccgctttat atgctgaaat gtttccagtg 240 
caaccagaag tttcaagtgt aaaattctgt ctttcctctt ctgttatttt aagcttttaa 3 00 

gacaccatac ataanagcaa ataaatgac 329 



<400> 207 



aactgtctac tggctgcaga taagagaatc tctttatggg ggaactgaaa acagaagaaa 
aatcaaggga taatggcatt tgagggttcc tcaatgaccg cccagccaca tcacaccgga 
gtggagcccc aacctgagag gctcttaccc agagcttcca gtcggcattt cagtggatca 
cttttaaaaa taaatggtga tggggtgatg gaaatgctac ccccaaaata eg 



60 
120 
180 
232 
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<210> 210 
<211> 133 
<212> DNA 
<213> Homo 



sapiens 



<400> 210 

agatggggtt ttgccttgnt acccangctg gataactact cttgatgaca taaaatctac 60 

tgnnngcagn aaagacagan agcatncacc ctaatacctt agttatgaan actacagaat 120 

cagtagaaga aca 133 



<210> 211 
<211> 270 
<212> DNA 
<213> Homo sapiens 



<400> 211 

gttctgcatg ctgataaaat gatcaacacc tgctggtctg aagggctcag caagaaactg 60 

actcatggga gaatgcactt tccatattct aatgacttca tcccccttac cctgaccaaa 120 

cgataacccc aattttctaa ccccttgccc tctccaatcc cctgaaagat ccttgcccag 180 

aacccctcaa tgaaatgaat ttgagtctcg agaattcctc ctgtttcctc attcagtcat 240 

cttgcaatta ttaaacaact tgtctgctgc 270 



<210> 212 
<211> 355 
<212> DNA 
<213> Homo 



sapiens 



<400> 212 
gtggagagaa cagcatgtgt gaaggcccag 
cttggtcagg ggaggatggt cggccaggac 
gctggccttg gtgggacttg ctcagggact 
nggcnttgtn attccgaaga ncnnggtctn 
cccaccccag gaggatcacc catatncaac 
aagggggaaa cgccaagccc attacattag 

<210> 213 
<211> 397 
<212> DNA 
<213> Homo sapiens 

<400> 213 
ctgcttggtg ctgcggtgtg ccctatcctg 
catacatcca cagccccagt cggctgtatc 
gcagcccacg ccaagggcca agagatgtga 
ggaagaggtc acaagcccag gaatgccagc 
atgagttttc ctctgaagct gccagaagga 
cagtaaaatt gattttgaac ttccaaaaaa 
acttaaccag gnngaacttg ttnaaaaggg 

<210> 214 
<211> 141 
<212> DNA 

<213> Homo sapiens 
<400> 214 

gtgttgagtg ggtccctttg gctggctgct ctatgaatgc tgtccttcgt gcataagaac 60 
tagtctaagc tcccaaagaa ctggatgcta atccctgtcc tgatactaac tcaccctggg 120 
acattaaaca ggtcaaaaaa c 141 



anccggcccc cggatctttt 
acatgcatgg ccccctggag 
cactgctggc cttggggagn 
ncncctgcaa ntgccgttnn 
nccnggagca gcntcagcca 
gacttttccc tgccatcact 



canaatgcat 60 

tcgtgcagct 120 

acanaactca 180 

cagaatngnn 240 

cnctnnaaac 300 

gggct 3 55 



gctgcatttc ttcattccct 
catgaagagc tgaatggaac 
aggtagaagc aagaagttag 
agccactaaa agctgaaaaa 
acaagcccag ccaatgcctt 
aaaaaggncn gngnggccan 
ggggggc 



cccctgccca 60 

aggatgactg 120 

aatgacctga 180 

aggcaaggaa 240 

gaccctagcc 300 

ttnagntngg 360 
397 



<210> 215 
<211> 96 
<212> DNA 
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<213> Homo sapiens 



<400> 215 

ttcctcctcc tgccatggtt tgactgagct gaacaaaccg gaaacttctc taggaaccgg 60 
gctatactat acatgtaatt aaaagttaat tatctt 96 



<210> 216 
<211> 305 
<212> DNA 
<213> Homo sapiens 



<400> 216 

aaagaaaaac tacatggaat gaggaaatag accactcctg ccttcaaaat cctcttcgtg 60 

aggtttatag aattcctaag aactcaggaa agacatcagc agagagcaat gatcgtcata 120 

gccagctcca cacagaatgc acccacccag ctacttgtcg aattacaacc tgatgatgga 180 

tccaccagaa actaagaatg gaaaggttat aaagaaatca cagcattcat cttctggaag 240 

aaaaagacta tttcttagaa agtaaaataa atgaataaaa gcacttaata aggagcataa 3 00 

cgcgc 305 



<210> 217 
<211> 427 
<212> DNA 
<213> Homo 



sapiens 



<400> 217 

ctttctctaa ggaagtgaca tataagctga gcctgaaaga tgaagaggag cagattgtat 60 

gcagagcaga gggaagagca agctgatgga ggtgactaat cagagggcct gatggtcaag 120 

tgctcaaggt ggagttaaag gaaaccctgc tttcttgaca tcaccagctg ctcagaagcc 180 

ttcagcaggc afccctagacc ttctccttct ctaagggatg ggcctcacct actttcttca 240 

gctgagacct ggcacagacc cttggagctt ctaaggaccc cattgtagcc ttggggtgga 300 

ggcccatggc accactgccc tctccctggg ataaaggtcc tggggccact tctcaaggct 360 

gggncccttt nttaagaagg aaatgntttt tcccaaataa cctnctcttc ttcctttttc 420 

ttcaccc 427 



<210> 218 
<211> 438 
<212> DNA 
<213> Homo 



sapiens 



<400> 218 

gacgtgataa cgagtcatac tgcggtggat cggcatgcac cctgtccccc ttcttacctc 60 

ccagaattac ctcagtatca tagcgtaggt gctttggaga aaactgactc ctcctagcaa 120 

taagtcttca gttgctttaa gctttaagca cattctttca gtcctctgat cactgtcatt 180 

tgtccagggg tgggcatgga ctttagtggt accaaaaaaa atctcgcatt cctatttgaa 240 

atgctgagac agaagtacag gctctcactt tctctgcagt tggcagagag ggaatgtggg 300 

ctcgattgct tctggcaaac attgtgcaag tcatgttggg aaaggggact tgaaatgaag 360 

cgaagattcc agaaaacaga acaaaccaaa agaaatggtg accactataa ctggcaactg 420 

tggagcctgc cctatctt 438 



<210> 219 
<211> 424 
<212> DNA 
<213> Homo sapie: 



<400> 219 

gaacactatg aaaagattgc aaaaccaaat catgagaagg ttagattcct actgaaatga 60 

aagatattca tggtatttgg aaactcttat aagcaagaag tccgaaaagt tcaagatact 120 

tctgtagaat ggtttaattt aaaaagtggc tgctatcctg gatggggtta agaagctgct 180 

ggtactctgc tctggatctc cttcttccct gttgttctcc tcccaacaaa taactctcat 240 

cttcaagtct accaaaagcg gctgacctta gtagcataac ctctaaacca aactcaactc 300 

ttaccttctc cataaagctg ccagaaattg ctcctgccga gagtaattta cctcttacac 3 60 

accactgtta tttcactgtg tgggactgna ttcccaanta aattgagaat gtctaataga 420 
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tttt 



<210> 220 
<211> 318 
<212> DNA 
<213> Homo sapiens 

<400> 220 

taaccggatc tcctcgaatt ccgcgcgcac gaagactcag gggagggggc cgagtggact 60 

tcaccccgca tgagacgtct ggcaaaataa gaaggctctc gcaaaaccta acaaccaaat 120 

atgcaaagcc ccaaatgaca accaccacct cctcgaacct cagaggtctg ggggcgtccg 180 

gctggaactg gggtttaaaa aaagaaaatg tttacaaagt ataacaagat gtttgatggg 240 

tggaaaaatg tatccacgag ttacatcccc ccgtttcctt gcaaagcccc gctggtcttc 300 

ctctcctttt cttctgcc 318 

<210> 221 
<211> 227 
<212> DNA 
<213> Homo sapiens 

<400> 221 

ccttcagact tggcctgaaa cattggtcct ccttgggttg tgagcctgca ggtcctcaga 60 

ctgaaactat ccatcagctc tcctggttct caggctcctg gattcaagct ggaagtacac 120 

atcaggtctc ctgggtcctc agcttgatga ctcgagatct tgggaattct cggcctctat 180 

aactgtgtgc cccaattccc tataataaat ctttgtcttt ctctccc 227 

<210> 222 
<211> 462 
<212> DNA 
<213> Homo sapiens 

<400> 222 

gtcgaaatcc ttccccgctg atataaatat ttgagttggg gagcagagct tcagggacca 60 

tgaagaaaat gctgctctgg ggacactaat tgaactttca tctagcaggt cctgtgccct 120 

acctactcaa gaacaagttc tgtttgatga agaagttaca cagctgccaa gttccctcat 180 

tctactacct atctaccccc aaattcagga atgtctccat atgttgacta tgcngacttt 240 

ttcagtgtcc tagtggaacc acagcttaaa aaatgggaaa tggaggcagt cccatatggc 300 

agagtctccg atgtggaatt aggcatcgtt ctccaaaagc cagcctgcag ccctttggag 360 

agcttactaa actataaatt gtcaactgta ttacatgata aagcagatgt gtccatacag 420 

taactctttt gctaataaat gaggnctaaa ttccaaaaat ag 462 

<210> 223 
<211> 465 
<212> DNA 
<213> Homo sapiens 

<400> 223 

tgttaaattc tcctgagtga atcacaagtc caaggtggct gaatgcactt gccagtctat 60 

tgctattgaa gcaccttaat gacataaaga agaagaaacc aatgaacatt gttatatatt 120 

tcattttaaa ctgatgtaga cattttgagg aaatctgcat tttgaaccag gttaactgtg 180 

gaatgccctt ggccaagagg aggggtccat ttgatgattg gatggcctta gaatttattt 240 

ttggttaata gtgccacaca gctaaatcca agagagtgtc ttagaaaata aactctggaa 300 

acatatttga gaaactaata agaatgatta actgtagagg gaagtgtcag gcctctgagc 360 

ccaagccaag ccatcgcatc ccctgtgacc tgcactatat gcccggatgg nctgaactta 420 

ctnaagaatn cccaaaagaa agnggatttt tgcccttgcc ccccc 465 

<210> 224 
<211> 184 
<212> DNA 
<213> Homo sapiens 

<400> 224 
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accattagaa tgtgacctct gtgaagacaa cagaaatgga ggaggcgatc catgggcatc 60 

ttctgaagct gttttggtta actttgattt ggaagtcctg gttccaggtt ctcctgtttc 120 

ctgggaccag ctccagaagt tcattatttt cataaataat aaatgaatgc atactaggga 180 

ctgg 184 

<210> 225 
<211> 124 
<212> DNA 

<213> Homo sapiens 



<400> 225 

tcttaacctt ttgagctccg ttcagcctgg ttaagnccaa gctgaattgg ccnattcctt 60 

tngccntttt accctggaag aaatactcat aagccacctt tgttatttac ccccaatctt 120 

caca 124 



<210> 226 
<211> 374 
<212> DNA 
<213> Homo 



sapiens 



<400> 226 

atgaagatca ttgagattag agaagaaaat gggatctggc caaggacata caactaagaa 60 

atggcggtgc cacagatgga gaaactgaca ctcagacagg ccaactgatc tgcccacatc 120 

aacgagctaa aaaaatggca aggccaggat ttggccctag gcctgcttaa ctctgaagac 180 

catgtgccca gtctcctgcc aggccattta catcctcagg aggattgctg cagccccagg 240 

acaggcgatt gccttttacc accctcctgc cagaccacac tgctgctgtc cctgctcctg 3 00 

taccccactt ttgctgggtt gaaaagggtg aaaggggtac cccactgctt gttgtacccc 360 

accccaaatt ttgc 374 



<210> 227 
<211> 318 
<212> DNA 
<213> Homo 



sapiens 



<400> 227 

atgcaatgaa attaacctct ccttccaaga acagcatgca ggcagctagc tggaaagact 60 

cacacttgag tgaatagcga cagctcgccc cttctgcgct ttgacgctgc tgtctctact 120 

ggccacttgg tctaccagtc agttgtgccc tgtatgtacc cagccatggc tgggaagact 180 

cacaaccaca agattgccta tcagtaggaa atacaggaaa ttacaggatg ggtatatgag 240 

acatatgtgg tggatataaa gctcaatagt agtgatacaa gtgtcatatt cagaaaataa 300 

tataaacttt cttgctat 318 



<210> 228 
<211> 502 
<212> DNA 
<213> Homo 



sapiens 



<400> 228 

gcccagaggg gactgtggac ttggtgccag aaaagaaaat gaaaagcaaa agttgaatct 60 

ctgcggacca ttctctggat gctgaatgtc ccactattac atctcggcat gacatttcat 120 

ggccagcagg ggaggaggcc cagtcctgaa agctgaacaa acgcccggca cacaggcctg 180 

cctgcgccct cgtagtctct ctggacttat gaataaaaga tggaggtttt gtctctgttg 240 

tttccctggt accctgtaag aataacaact tgttgctttt tgacatttta acttactttg 300 

aaaaatgacc aatattaact ttacatgtct tggcccttaa atctggagtg gggtaaaatg 3 60 

aaagaaacaa aagccatgta attangnaga agataataat tcaaggtaaa ctaatgaact 420 

gnctgnaccg actttattaa aanatggngg gacatgccat cccnaactaa aagnttaaac 480 

ctgacttgga ggaaccttgg gc 502 



<210> 229 
<211> 228 
<212> DNA 
<213> Homo 



sapiens 



50 



5 



<400> 229 



gagacactnc ggaaggcnca gaagatagaa cacagagggc 
actgaaatcg gagtgatgca gncacanncc aaggaatgcc 
agangcanga natagactct cttctatagc ctgtggagct 
gatttctgcc ctccagaacc atgacagaat aaagttctgt 



naggccatgt gaanacagat 
tggagccacc aaaagntggn 
ctggtaatac cttgnttttg 
cttaagcc 



60 
120 
180 
228 



<210> 230 
<211> 395 
<212> DNA 
<213> Homo sapiens 

<400> 230 

ctccttcntc aaaaagtgga atccaagttg tctacccttc acaactgaac tggctacatg 60 

acttgctttg ttcgaactgg ctgcatgact tgctttgttc aaccaaatgc tgcagaagtg 120 

acggtgcaac acttccaaac ttaagaggct ttgcatgctt ccatccctgc tcttgatttt 180 

gagccacccc tgtcacacca gtcaataagc tggctagctg aaaaacgtat aagtgagcct 240 

gtgccaggcc agccagtgtt agctgacttt tcacctaact gcagacacat gtgcaaaccc 300 

aacccaaata agccaagcct gacccagctc aacagaacta tcaggtgacc tatagacata 360 

cgaacaataa taataaaaca aaacctaagc cactc 395 

<210> 231 
<211> 178 
<212> DNA 

<213> Homo sapiens 
<400> 231 

gtttcccaaa ggatccaaaa aactgagagg gaagagattt ggggaagatg tcacttttcc 60 
tcatctgact ttgccttgga gtcagatggg agaatgactc ctggagaaca cttagccttt 120 
tccagctttc cccaanaaag gctggcccag ggaggcttct ataaaccttc tccctatg 178 

<210> 232 
<211> 299 
<212> DNA 

<213> Homo sapiens 
<400> 232 

ctcaccagag acctcaaatc cttacctgga ggtcaaaaaa cttgctgtag cgccggtaaa 60 
tggcctcngt ggagccgntg gaccacgtga cccggatgat gtacacctgc gggagcaaca 12 0 

aaangagatg ggtgttaaca ccagaaggtg gtctcccaat ctctgggacc cagggggagc 180 
ncaagactca nagtcanaaa gacgtgggtt tcaaccttag ctctgccaat gactggctgg 240 
acaagttgct tgctgtaagc ctcatctccc tcctcaataa aatgagtgta ataaccccc 299 

<210> 233 
<211> 137 
<212> DNA 
<213> Homo sapiens 

<400> 233 

gngaggatgc naaganaaaa ggtggctgnc tgnaaccagg gagggagaan ccttcccagg 60 

gaccaatcta gcttgaactt ttgactttgg acttcaacct ccagtattgn aaagaaataa 120 

atatgttttc aaaagtc 13 7 

<210> 234 
<211> 216 
<212> DNA 

<213> Homo sapiens 
<400> 234 

agatatggtc tcactatgtt caagtctaag actcaaactc caggactcaa accatcctcc 60 
cacctcattc tctcaagtag ctgagactac agggatcgaa agatgaagaa ctcttgttga 120 
agctcataac tccctaatta cttattatta acagtgaaaa tctgattttc aaagttgttt 180 
aatggtcatg caataaagca atgtaagacg actgcc 216 
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<210> 235 
<211> 281 
<212> DNA 

<213> Homo sapiens 

<400> 235 
gtctttggac ccagattgga actataccat 
gactgcagat cttgggactt ctcagcctcc 
gaaggtaaaa gcatcaaaat caccacagca 
gagaacagtc ttgttttcct tgcgttggca 
cagagcctca ttataggttt agcaattaca 

<210> 236 
<211> 491 
<212> DNA 
<213> Homo sapiens 

<400> 236 

cttgctagaa gagcactgga gatagagtcg gatacgcttt aaaggacaag ggaaaacagb 60 

tcccagtgga tggtacacac atggcaaaag gccaagagta gaagcaccgt cattaggaaa 120 

aggaatcagc caaggtccca ggcaagaaga ggtgaggcaa atggaggctc tgaggaaagt 180 

ggctccaaag cctacatgat ggaagataac tctggaagag aaagagatga ccgttcctaa 240 

gcttgtatag caaaacttga gagaaggtaa cgaagatgtg acatctgaac tcagagaaat 300 

ataacttcta tagaaaagaa acaaggcctt gcagctctat aaggaacagt aaataaatca 360 

agtatgcaca caagaagtaa aaaaatatat ccnagtagaa aggaagcttt tcattgaaat 420 

gnccccagaa ctcatgctct tgganggccg ggatngcaaa atcaagnntt tttttaaaaa 480 

ctcctacccg g ^g-j. 

<210> 237 
<211> 199 
<212> DNA 
<213> Homo sapiens 

<400> 237 

aggataaaaa agaagtaaga aaatagagtc tctgaatata gatctttcaa ctgaaaaact 60 
gggctgtgaa gcttttggac tcgaagtaca gcctttcctg agtctccagc gcactggcct 120 
ccccccatca gattttggac tctccaagct tccacaagca caggagccaa ttccttaaaa 180 
taaatctgtt tctatatcc 

<210> 238 
<211> 282 
<212> DNA 
<213> Homo sapiens 

<400> 238 

cccccaagga ctgggatcaa tattggaaac ctgtgcttta 
gctatgctgt gtgacctcag gactgggccg actgggagca 
aactggagtg ccttggggag gaaggaggag agcacagtct 
agcaaataca agtggtcatg caggaagaag agtgctggtt 
gatgtacggg gggtgtgctt tgttcaatat gacaacacta 

<210> 239 
<211> 206 
<212> DNA 
<213> Homo sapiens 

<400> 239 
attgagcacc tgagagtctc aagtaacaca 
cgtacattgt gacttgttgc tctcaccatc 
aaagaagtgt tgtctactgc tgtgaaagtc 
cagccttttc aagaaatttt gtctag 



tggctctcct gggtttcaag 
ataattatgg gtgagaagca 
acaaagattt ctcaggaaat 
ggtgactcac tgcatagata 
ttttaaaaat t 



cttgcttgct 60 

ggagctcaga 120 

tataaatgct 180 

tgatcatctt 240 



gttcttccac ctctgctgct 60 

ccatgtggag aacagagaca 120 

ctgagtcagc catgaggcag 180 

ctgcggggtc ctaagaggga 240 

cc 282 



cctggtttgg ctgctttgct gaagacactc 60 
aacaggaatt gggctgtgca agcaattctg 120 
atcaacttta tcagacccca gtcctgaccc 180 
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<210> 240 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<400> 240 

cacttggcac tgtacnaaac accttcatat ataccctgtc accctgactg agcaggatcg 60 

ctcagttcca ttttacagga tgaggtgaag acttttcaaa gccagagctc taccctgata 120 

gcacaccgtc aggatgttca ggaagagcct catgggttat tacagctcag gatgcatcca 180 

gacactgtct ccatggcctg cggagctgct ctctgaggac tcacttcact gcccctcatt 240 

tcccaggctc atggagatat actacctgtc acctctgggc ctggagggca gatggaggta 3 00 

agatgcaaag gaagactgcg tcgtcaaagc agatggaagc attccctaac acctgggcca 360 

tcctgggtcc taacttaatt actaaagaat aagggagatt tcaaagnaaa atgnncagac 420 

atttgnttat ttgaacataa aactgggggc ccnccaccag tattttggta ac 472 

<210> 241 
<211> 283 
<212> DNA 
<213> Homo sapiens 

<400> 241 

ccttgcaaat angtgatttc ctgccagtcc ctgcctctgt gaccaacctt gattgttcaa 60 

agtatagctc tgcaagcagt ggctacggac agtttccaac atgcaagttc atctccgacc 120 

ccacttcatc attcctcctg cccccagcac tcctggatgc tatgctgaat tgttttggta 180 

cctttggttt gtgagccttc ttaaaccttt ctttcttcta ctttattatt atcattgtat 240 

tataaaagca atagatgctc attactttaa aaaatgtaaa age 2 83 

<210> 242 
<211> 193 
<212> DNA 
<213> Homo sapiens 

<400> 242 

gcactgtctt cataagtcca caggtctcaa actccagcat ctcagaatga aaggattcac 60 

aagtgctcac aagaggcttg getgecaggg gaagctccga cctgaagatt tgaactaatg 120 

agggactata aaggecaaga ccttgttctt gecattttag agattcagaa tataatctac 180 

aaagttagag att 193 

<210> 243 
<211> 501 
<212> DNA 
<213> Homo sapiens 

<400> 243 

ectgeagagg tcanggagag ageccgatgg eggtcttaat gaagaggaag gaggaaagga 60 

cgcagctttt tttacccccc ggcttaattt actcegtatt eggcttaact tactccctat 120 

tctaccctcc ggtcttcaag ttcccttaag ctcgttggcc tgttacccag taaaactaca 180 

aggaaatggt ctgtgtggtg aattttgaag ctgtccacag tacagatact ccagtgtctg 240 

cccttccaga aaagagctgg acctaagggg tcctcctgtc tcacgtgcag actcccaggg 3 00 

egggattaaa aaggcaaaaa tccnnngttt cntngcaaat cennggnant nngggnnnga 3 60 

nntnntnntg cenenntttg gganggaang aancanaatt aatttngggg ctntaaaggg 420 

tttatttata aangggcttn gggnttctat tttattgggg aanaaatnee ggganttaaa 480 

aatntaaaga cccccttcca a 501 

<210> 244 
<211> 327 
<212> DNA 
<213> Homo sapiens 

<400> 244 

gttcttccta acaagaagct acgaagttct tattcagaaa aacggaacac gacatcacac 60 
ccacgtgaaa aaaacgcttt taagaggeca agtcactttc acctcccacc aacttgecaa 120 
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aggctgaaag caggcggaca cgcccccaag cgctcttctc cgatttcatt ggttgccccg 180 

gcctgctcct cattaggtct ctctcactgg tcagcaatgc cgctttcaca gccaattctc 240 

agaaccaatc atctccaact attgccccgc ctctccacca cgtgagtggc ataggtgcca 3 00 

accaataaaa aaagaaaata aggatgt 3 27 

<210> 245 
<211> 100 
<212> DNA 
<213> Homo sapiens 

<400> 245 

gcangggcnt ccngnggttc aagggtacaa taanctgcga ncgtgccnct ganttctacc 60 
tgggatgaca gagtgggacc ctgtgccaca aagagagacc 100 

<210> 246 

<211> 505 

<212> DNA 

<213> Homo sapiens 

<400> 246 

aaggctgtct cctgcgagga ccagaagttg agccaaggca cgtggaactt acaatagcag 60 

atggtaagaa ccagggcaga aggagaactc ctgaagcctc cgaaggaagg aaatcattac 120 

agggccctac agaagtaggt catgtgctac agctgctcat agtttaagag gaagaaacat 180 

gggatctcaa acctggaaca cgactctttc aaaatgcctg tgagcaaccc aagaaaaaca 240 

tcctcctgag gcttatctaa taaccatgat ctctaatcgt ctcaatgtgt gctcatgttt 3 00 

ccttaagaag tttgcaccca cttctcagag ctaacgagat gccgaaacag aacacagaaa 3 60 

aaagtaatga aggagattta ataagntgng ntaaagctna tatgggccat taaggggcng 420 

gcttttttta aaacaanggg gnggaaccgt tcccctnttt tttggnggaa aagnnttttc 4 80 

nggggcangg acctggaaac cattc 505 

<210> 247 
<211> 139 
<212> DNA 
<213> Homo sapiens 

<400> 247 

ataaaatctc ctggcagaga aaatggacag tcgttccata ccatatgtct tctcagcttc 60 

aaaatcaaca acaacaacaa caacaaaaaa ccccaaaact tccatcatct gcagaagtca 12 0 

aataaaactt tcaaacttg 13 9 

<210> 248 
<211> 261 
<212> DNA 
<213> Homo sapiens 

<400> 248 

ttgtaaatta tgctcatgaa aagagacccc agcatctttc aaactgangg ttaaccttat 60 

tatcaggata atcaccaatt cacaggaagt tgcaaggatg gtatggagag cttccattta 120 

ttcctcggtt ttccccaatg attacacctc acataactgt acctcaggaa actgaagctg 180 

gtacagtgtg tgtgtatagt tccatgccat ttcgtcttaa gtgtagatct ccaatcaaat 240 

aaagaaatat cctgtcacca c 261 

<210> 249 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<400> 249 

gtggggtctt tcagtatgta caaacataca tgattcagga taaaagatgg atcgtacccg 60 

ttctcaccac agaaaagtaa ccggagactc ttctaagaaa tcgagaaaag aacgcccttt 120 

ctcctgccct cctgtctaaa gcgcaacata ataatcgaat ctcccaagct tcttagggtg 180 

ctgagtgttt taatccacca gccctcttca actagttaat aaatcctttc cagaccgaga 240 
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g 



<210> 250 
<211> 505 
<212> DNA 

<213> Homo sapiens 
<400> 250 

gnaanctgnt agnncatgcc ngacaccttn tctccatgcc tgcnctttct gttccaagcc 60 
atntggtgga agcaatccaa ttgcctgcag aatcatccga aagcatcact gggaagaagc 120 
tggtggaact aagaagcaat tctttagcct gacagccagt ctgtttttag tatttctaaa 180 
catgaaatca tctcaggaga agccaagggc tgtcgaggtg atttgcctga ggtcctacaa 240 
ctcatcactg actgtgtttg gaggaaggaa gtaattaact ataaatgtga ttataagggt 300 
ggggccttaa tctgatagga ccagtgtcct tataagaaca ggaagtgtgt gccgttcact 3 60 

gaggaaaagc catgcaagaa cacaaagaaa angcggctgt cttgcaacct ngaagaaaan 420 
ctttgcctaa aactaatcct gccgggcatn ttaatcttgg naattccagc ctccaaacag 480 
nganaaataa aggctggtgg ttatg 505 

<210> 251 

<211> 90 

<212> DNA 

<213> Homo sapiens 

<400> 251 

agaaacaaat acatcaacgg agacaacttt ggaaacaatg gaaacaaaga accaaaaatg 60 
ggcctgcaca taaataaaaa ctccatatac 90 

<210> 252 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<400> 252 

aagaaggggg tttccgccat ggttngccca ggctggtctc aagctcctga actcaagnga 60 

tcttcccncc taagcctccc aaaagngctg gggattacag gcatgagcca cgactcccag 120 

cctgaaatat annattttaa tcttcagctt gcattttgtt ctaaacaact tgttttcaaa 180 

taagaaccgg gcagaaccaa gtttaagcca ccatttgttn ggaggccaga atcaatttta 240 

ttgggtggtg gttcaaaatn gggaactggn actaagcctt ccttcttccc ctccatcctt 300 

cctagcccat tgnngcangg gggaaatttt tctcnttttt tggnnggggg taaaacaact 360 

tctttccctc attctgggaa ttngcccttc aacctaattg ttggacaaac cgaaaaaaat 420 

ttcaaaggcc ccccaaaaaa taagcaaggc aaggcttacc attaatncct tttggcatgg 480 

aacaangggg gaaaattttt ttttggcctt aaanggnttn gggggcctag ccaccttgaa 540 

aaaacaanna nggcccgggt tnacctttcc gaatcntggg gggcttcca 589 

<210> 253 
<211> 498 
<212> DNA 

<213> Homo sapiens 
<400> 253 

gttccaggcc atcaagctac aaanggactt accaatggtg ccttnaaaag agctcaacgt 60 
gcggnttntn ttggngacat cacgggnccn ananaaaatg gnttaattta tgtaacaaat 120 
cccctctgga ggacaccana actgnngggc cccttntttg ccctnatccg cngaaagnag 180 
cccgaatgac cactncccag gtnccaacag cananggggg ggccnntcna aaaacnagga 240 
ctgagaggag ggaccccccg gctttctggg tcctgcnggg gctcacaaaa gttgtgaaan 300 
tcatttattt tcttgcntca agacnttctt ntgtgctggg gngaanaaaa attgaaacat 360 
atgctttaaa aaattctaac aaccacggag ttgngcattg tgttttnttn ccccaagaaa 420 
agcttttaac agnggaaaaa tttgnttnta agcttncctg ggggctctnt tcctggggtn 480 
cctttccttt tccctgaa 4 98 

<210> 254 
<211> 303 
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<212> DNA 

<213> Homo sapiens 



<400> 254 
ggccttcatg gaaactgctc tggtgtcaca 
gtacaagctc gtctgaaaag agttggcttg 
tcacaatgaa ataatcattt tattttgtcc 
gaggaaaaaa caaacaaaca aacaaacatt 
aattattttt ggtttattgt gttatctaca 
ccg 



gaaatatatc caaggatgga gtgtgtacgt 
caaatgggag aagctgtcca agaagtattc 
ataccgacaa acaaccagtc aattcagctg 
ttattttcca aatttgtaat gagttcgctt 
tagttgaatc ttaaatctga attttcataa 



<210> 255 
<211> 441 
<212> DNA 
<213> Homo sapiens 



<400> 255 
caggatggcc tagatttcct tacggcatcg 
tcngccaagc ctcttaatgg caaggccctg 
tattggttaa agcctgtcac aaagcctgtc 
ctcctgatag aaaaaagctg cacatgcata 
gaagagtatc tgccccatta agccatggga 
ctggattgta gaaactgatc tatagacatg 
catctgntca aactggcatc actcacacat 
agcctcaata agaattcatg g 



aggacgagat ccaagacagc aaaagcagac 
aagcagcaga gcttcacttc tgccacctcc 
gagattcaga aaaagagaga tagaacccac 
aagaaaggag aggatttgac agctatcttt 
tattttcccc ataaaagaaa ggactatgat 
aatctgaact taagagaatt tgactaattc 
atttctgnaa ggattcactc ttccatgggt 



<210> 256 
<211> 431 
<212> DNA 

<213> Homo sapiens 



<400> 256 
aaaaatcctg cctccngtgc tcctgagtcn 
aagggggggt ggcataccag cttaaagaac 
anngnatnng aagncccatg ctgctctgan 
cctactanac tctggtgcag ggctgcanat 
ggcgcnccgc anggggagtt ccttctcagg 
cagaatggat gaagcccccc ccccctntgg 
antnaactat taatcctatc tnaaaaacng 
acaaaaactg g 



ctnctntgcc tncaggnggg tcctggcnca 
tgtgttcnnt tgnctgcaac cctgnagtac 
ggcgtcggaa tatgngancg atccttgcct 
ccacaaagcc caagctgcag caagtccgaa 
agactgnggc tttgctctta cggccttcga 
anggtaaccc gctgcattca aaggcnaccg 
gcttccanaa acaaccacac ttgtgtttga 



<210> 257 
<211> 332 
<212> DNA 
<213> Homo 



sapiens 



<400> 257 

gagcctntnt ccctggcaaa tgggcttcac tgttcatcac agaaacctcc tgaaggaccc 

atctactctt caatcaacag ctggtgccct acgattctct gaatcccttg cctggcctca 

aaatccctca cctcatggct tccaccagtc ctggactact gtgttcctta cacaacccta 

accaagcccc cacattgaca cacccacctt aaagagnact gctaggcttc agaaaaccca 

accttgcctc cttcctccca gacaggccaa agccctctgg aatcagcgcc ctcccttcgg 
caagtgagta ataaactcag ctttgcctta cc 



<210> 258 
<211> 309 
<212> DNA 
<213> Homo 



sapiens 



<400> 258 

gtgccaatat cggtcagaga acaggatttc agtggcagag ttg-ttgctat actgttatct 
cttcagaacg gaggcacaag gagagatgaa tgccacatcg caaggagcaa aggagagaga 
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gagaaagaaa tggtgtcagg tggcatgttg gatgtgattt ttgttttagt agagattgag 180 

atgactgtaa attgtttagc tgattccttc ggtctgcaaa gatacatttg tgttggtgct 240 

gatggttctt gactaatcct gtttcaatta caaattggtt atgtttttca aataaaactt 300 

ctggcactt 309 



<210> 259 
<211> 427 
<212> DNA 
<213> Homo 



sapiens 



<400> 259 

gctttggaag gagtttaaac cttaagctta ccctttcaat catccactac cccagggaca 60 

gaaggtgggg aaaactcaaa ggcacangct tgtactgaga agttttgagc aatggagaag 120 

aaagtgggag cttctgactg accttagccc accacagtca ggctncaaga ngggagatgg 180 

cctgggntna tggctgcctt tcntctggtg nnccttacct tttgggaaaa cccccanggn 240 

nagaaaagtc ttcaagtctt gtcagactgg gaagtcccca actcccaacc tnaggaagca 300 

gcccttggaa angagaagga tgagattttc caaagctatc tcttaccact ttccttnccc 3 60 

catctcattc cntccatnta ttggggagaa gncctctnaa gttnggcctg angcttctga 420 

gggattg 427 

<210> 260 
<211> 478 
<212> DNA 
<213> Homo sapiens 



<400> 260 

acatggaaac tgaggaacag agagatcaca tatcttgcac aaggtcctac agttggagag 60 

agaatgacta tttcaacaat ggcaaatagg gttcatcatg tatgcacact ctgattgctt 120 

tgtggtggct tcctggatca ctgggttgaa aaagacccag gctctgtagg aggtggttga 180 

ttaatgatgt ctgccattca gaacaaagat gtagcagcag gtgtacctca tttttgctgt 240 

ctctggacta ttccattgaa gcctttagtt cctggattat ccaattagcc ctagctttcc 300 

tggcagtgtg atctccctct gccttaatat cagccctcag ccctcgggat tcttctctct 3 60 

gatatccaca ctcattgcct ttgcttctct gngctcccta aaacaacgac ttttcttccc 420 

caagccnaat tggaantaan tcctacctcc agngnganac tggccccggt cggcagcc 478 



<210> 261 
<211> 412 
<212> DNA 
<213> Homo 



sapiens 



<400> 261 

gaaagtagcc aaatcacctc cctggctctg gaagggtgtg gaagtggtgg agtaagagtc 60 

ccagcccaga taagggatca ccaccagaag atgaagaaga tggtatgtcc agagatccaa 120 

aggcaatgcg ggcctcacag tagatgccag cacacagtgg tgacaaacgc ttggacaaaa 180 

cccatcaatc tcatgaacag cagagaggag aaacattgag tgaggatcag cagcctccta 240 

gagcactagg ctcctgcatc agtctcctgc aacttagata ccaccttgag gtcgggggtg 3 00 

gtgacaggtt tcattgtcaa ttgatgagtt tgtttcaatc taaaaaaaat taggtggggc 360 

ccagaatgaa ctaagatgat gtttttctgt cttgganggg accgggcctt ga 412 

<210> 262 
<211> 389 
<212> DNA 
<213> Homo sapiens 



<400> 262 

gctccagacc tgtgtgtgca ngctgcctcc tggatgcccc tcggttgtct aatggacatc 60 

tcaaacctca catgtctcca cttgaaaagg atgagtttca tggaacctga gcatgcccat 12 0 

atgcccctac tcccttgtgt gcccccacac cgtgcctgct cttccttcag ttgatcaggt 180 

gaaaacctca gagtcacttt taacacctcc atttctctcc tgtgccaaca accaaattat 240 

atccaaaatc tgaccacttc tcaccacttc cacatggact gctgtgttca agccaccacc 300 

atctcttgcc tgcattagtc cagcagtctc ctanctgaca tggggactga gattcagaat 360 

atttgggatc aaaggtctta tccctgaat 389 



57 



CAl -201292.1 



<210> 263 
<211> 298 
<212> DNA 
<213> Homo sapiens 



<400> 263 

aatgttaacc acaggacgtt ccagctgtga ctcattgcaa ctactgacaa gcaagctgga 60 

gtggccctgc ttttagagag cctgaagatc tactcagagt gaacaatact tgaagttcta 120 

attgagttac agaaaggaaa ctagtaaaaa ctaagaaaga ttgcgattct caccttgaat 180 

atgcagatct aatttctata actgtgttta ggggtatttt tctaaattac taaaataatg 240 

cttacatttt caaattggcc attaaatata tcttcagatg cggagatgtg tatattac 298 



<210> 264 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<400> 264 

acagagctct gcaggcacag ctgaggacgg cctctctttg ggtcccccag actcatccct 60 

gggagctcac aactggcaga gggagacaag ggcgctccaa gcagcagccg tgggggagtg 120 

gtgatctcca gcttcaactg ccgggccgtg aaaacaggaa ccagccctcc aggccaccgt 180 

ttctctgaaa ccaaagctca gcaaccgaaa aaggatcaaa aaagcagatg gtggaggtgg 240 

agcgaggcag ctgtgcttct cagtgccccg tgccgtcctc agccccatct ctggcacaag 300 

tggtccaagc agcccaggac tccatggcag gccctaccct tgcaggtgaa ctgcctcggg 360 

tctnccagcc tccacattca catatttcaa acagaaacac caccaacttn ctgggctnac 42 0 

ccnttgggaa attccccaan gaaaacaaag ggggactcat atttgggcca 470 



<210> 265 
<211> 202 
<212> DNA 
<213> Homo 



sapiens 



<400> 265 

ctgaggaaaa acctacaagt ctacttggag gaatccccag cattttcaac aggatgtcag 60 

aatgaccttg ggctatgttg gcaaagcaca atgggaagaa gacaaccaat tgaaggtcaa 120 

actaggcctt aaaaaaaatt gttcttccta aatgaaactt tatgtaagac ccaaacttcc 180 

tttatgtaaa aataggatac cc 202 

<210> 266 
<211> 258 
<212> DNA 
<213> Homo sapiens 



<400> 266 

ttttccgtct gtccagctcc accactaaat agtgtcttta ttccgaggag ctacctgatt 60 

tgggactcag tcttcctaca aggcaaaaag agaagacctg gatgctccac gtggtccaga 120 

catggagcaa gtaaaccgag ctctcgccac accgcacagt ctcctcagcc tcctgctcaa 180 

tgtgctttca ttggaaatgc ttattgtaaa tgatgacact tttttaaaac caaaattcaa 240 

ttaaattcaa tacatatt 25 8 

<210> 267 

<211> 320 

<212> DNA 

<213> Homo sapiens 

<400> 267 

gataataaaa catgaagtgg aagatcttct agaccagcac cttaaatttg cagatgagaa 60 

agttggaacc cagaaaggct gagaggctca aggtctcaca actgtttatg ctcaactggg 120 

aaatgaattt gtttctctgg cccatcaggt caacattctt tccactcagc tatgccgnct 180 

cctacctcct gaaaagattc tagcaggacc ctctgatgaa aaggacctta tctttttata 240 

tctgctgttt aaagcttttt tttaaaatca tcgcacgatt ttatgagtta agttatgtac 300 

ataaacaaat actattactt 320 
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<210> 268 
<211> 498 
<212> DNA 
<213> Homo 



sapiens 



<400> 268 

gagcatgacc agcagactaa cgcagcaagc agatgatgct cctgatgaaa agggcagacc 60 

cagttgagcc tgggctacgc tgacacagac tttgttgctc ttcatttggc aaagttcctc 120 

ccagaatccc tgcaggcata caacagatgt tcagtaaaca ctcggttgat gagaactctg 180 

ggaagacata gctgttcgac gaacaggcat cagaatttat catttgaaat tatcaactca 240 

aaaattcttt ttttcctcat acatattctg cttatgtatc aaaaattatc ataagaaacc 300 

aagatttctc agaacatgtg aggtcaaaat ggcttataat gtaaaagaag tggagtctca 360 

atctatactc agtatctccc tctcttttat tcatacacat atggacactt gcacttctaa 420 

gaaaaaatga atttttttaa actcattcat ttattaaatt gatatggatt aaaccangna 480 

atattcataa catattct 498 



<210> 269 
<211> 342 
<212> DNA 
<213> Homo 



sapiens 



<400> 269 

cntctctgga gagcttncat ctgcaccatg agcccatgcc atcttctgac tcctggagct 60 

acagtgaaga tatattttgt attaatgctt aacttcttca tttcagttgc cattgaggta 120 

gcctaataac attcataagt aaatactgga ttttagtttg caatagaaaa accttccatg 180 

taatataata tgtctataca attaataatt aattactttg ttaaaatatg tatctttaaa 240 

taaataaaca ttggtagaga ccaaaaaaaa aagaaaaaaa aanggccacn gnggccaatt 300 

cagctnggac ttaaccaggc tgaacttgnt caaaaggggg gg 342 



<210> 270 
<211> 159 
<212> DNA 

<213> Homo sapiens 



<400> 270 

ccagcattta tggatcttca gaggnttctc tctgtgataa ttcctcatca aattaccaat 60 

aagaaggata tgaaactaca gcccccacaa ggatgcctgg tgaccttcgg ccctgagatt 120 

tacagtctgc ggaagcaata aagttcctct ccctctctt 159 

<210> 271 

<211> 521 

<212> DNA 

<213> Homo sapiens 



<400> 271 

ggcaccgcaa gacaacgtat ctcccctccc ctgtgcaatc agtcaaagaa catttagtca 60 

acctgaactg ggagcacagc gctcctgggg ctgttgggca ttcaaaagag tgtggatcag 12 0 

tgttaaaagt gcctcatgga gaaatggagg cctgaaagcg actctgaagg aggagtgggg 180 

ctcagcaaac agcagacgag tttcaatcca agcacccatt acccccctaa cacacggcat 240 

acgtgcatct catctcctcc tgtgtcgcta agaagctacc catatgtctg tcattaattc 300 

tccagaatcc ttggacacac ccctctgcag agctttctaa cagaaataca agtctcagat 360 

ttttttttaa gttaaaattg agtgcagcac tcataccttt cttcgagcat gaaccgtcaa 420 

tcaacactgc ctcatgagct actgntctcc tgctctttaa aaagacaaan ccttatttct 480 

ttgtagngat cncaaagngg ngggattnac ccggaaactt t 521 

<210> 272 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<400> 272 

agtttcactc tcagaggagg attttgttct tcaattgtgg agtgatctct atcaccagtg 60 
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actaaagcag atgttggagc acagagagcc ataccccaaa atatgatgct tcggcatgct 120 

gactgctttg aaaattgaaa ggcctcagaa ataatcctca gtgccagggt ctccctctga 180 

cctcccccta cctccctttc tctctgatcc tgtctctccc aaagcacaga atgagctgtt 240 

ctctgaattc ccttatctac ctagaaactg gacccccaaa gaggaacaca atttgccttt 3 00 

gatcccttcc ctgaaatttc attaaccaga gaaaattaaa acttctatca caaggaagag 3 60 

actgaacatt aaacaccata gctacagccc agacaaactt cttcccaaac cattgtttgt 420 

tctcctgcct gttaaattgc cagagaatca ttcacaagac 460 

<210> 273 
<211> 224 
<212> DNA 

<213> Homo sapiens 
<400> 273 

ttgacaggaa ggcaatcatt cattcattca gcaagcaagc aagcaagcat ccacaatgag 60 
cctggatgcc acatggacca cgatcaccaa ggagatcgat aaatcccaca atgttgttcc 120 
ctgtcttcaa aaatttgtca agaagattga gatccactgc tgtaagatta cacagatgcc 180 
ctcctcatcg tctatgacag gctataataa atcttgccag actt 224 

<210> 274 
<211> 338 
<212> DNA 

<213> Homo sapiens 
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<400> 274 

aggcgagaaa ctgtgggata agaggctgca gcaattgcat gagtagaccc tgaaggtatg bu 

aggtttgtta aaatggatgt tcagagaagg cctgacacaa gagggccact ccatttgtcc 120 

ccacggacct gggccggatc tctcaatttc acactgatgg agcctgaaaa tcaacaaaca 180 

agacggcaag aacagggaag acattgttct ctccaaagtg gacaatttgt gacaggccca 240 

ggaaggctgc ctgggcttta tagcttttcc agtggttcct aataaaccag gctttgtgtg 300 

agcctcgttc aagccatgcg gggccctgtc gtttcttt 33 8 



<210> 275 
<211> 158 
<212> DNA 
<213> Homo sapiens 

<400> 275 

tcccaggtgt atccaccagc tccgaagaga cagcgaccan gcaagaacgg gccataacga 60 

cgatggcagc tttgtcaaaa agggggatat gtagggaaaa gagagatccg actgttactg 120 

tgtctacata gaaaaggaag acataagaaa ctcctttt 15 8 

<210> 276 
<211> 144 
<212> DNA 
<213> Homo sapiens 

<400> 276 

acttcagttg acccaggcaa ctgaaaccga ggaagcaaaa ccatggaccg tggaaagaag 60 

catcatatag gactactgta ttatgtatta taggtggctg tggtatcaac atacttagtt 12 0 

gataataaaa atgtttgcaa agtc 144 

<210> 277 
<211> 561 
<212> DNA 
<213> Homo sapiens 

<400> 277 

gagcccatca tggcgacgcc ccctaagcgg cgggcggngg aggccacggg gganaaangg 60 

ctnggctnca aaactttant antgancngn ctgcacggga ctggcgaaan ggggctgaac 120 

catcgaaaca gggtattatg aagccagctg ggccaaatac cttcaactgg agaaaatggt 180 

catttgagcc gaacttncag ggaaagctaa agcactcggg aagttattat atgccaggtg 240 
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ggattttggg cctggtaaac tttcttcgtt 
acgccatcta tgtggcccct ggggaaaaat 
aagaaagcct tctaaagttt catttgattc 
aagcaaacag ccctcaccca agggactcca 
gctagagggg cttaaaaaac tacaaagggc 
ggcttttttt ccccctactc a 

<210> 278 
<211> 338 
<212> DNA 
<213> Homo sapiens 

<400> 278 

tgtaagctcc accagagcag cagactctca taaaacctca tgggatgaat gaaaggagtg 60 

tcatccctta agacattggc aacaaaagca tagcctgaca tattctacta caagtgcctg 120 

cagtaaccta tgcagagagg agcaaatgaa ctcccacagg aaggtggacg atccctgagc 180 

cagagataac tggaactctg gcagtttgag tggacactca gtcacacact cacacactca 240 

ctcacagcgt tatgcaattc caaaaattat gtgtttggtt ccaggaagat acatttttcc 3 00 

cctctaagtc caaaataaag atagaaatgc atatatct 338 

<210> 279 

<211> 271 

<212> DNA 

<213> Homo sapiens 

<400> 279 

gttcccagta gctgcagcag tgaaagacag tgattggctc cagtgctccc agaaggattt 60 

gggctgaagc caggggaaca gaaccagaag aggattccct ttccagagac catcaggctc 120 

ctcatgtctt gtctctcctc tctcccctcg tggtggctca ggatttcagt atggctgagc 180 

agcccatagg taggcctcaa cacttggtgt caccacttca gtctctatat gtttggccct 240 

tgtgtaaaat aaacaaaaac ttgggcaacc c 271 

<210> 280 
<211> 490 
<212> DNA 
<213> Homo sapiens 

<400> 280 

gagctggtca gctctgacct ggagtgtgtc taccctgacc gtgtgacacc gggatcaaga 60 

ccctctcctg gggtcttgag gacgccacat gtgggcgttg ctctaaagag cgcttgctcc 120 

taagcctcct gcacatggaa ccccaccatg gaatctgctt cccaggaact caccctggga 180 

ccagcccctc tgagactcaa gtcaacattt ggtcctaggg ctgcaaagag gaggtgctaa 240 

gaggccaaag gctacttcca cctggagaac gggccccgcg tgccagctcc cccaaggcct 300 

ggccaggatg ctctgctcgg aggcctgtcc tgacttcctc tgctcattgc acctgaaatt 360 

acctaaccaa cacctttctt cctcccaccc ttccacaaat acttattgag catctgctag 420 

gtgccaagtt actggtctgc acaatgggca ttacnggccc tgaaagaaat taaacnggaa 480 

ccttcttgtt 490 

<210> 281 
<211> 512 
<212> DNA 
<213> Homo sapiens 

<400> 281 

gaggtgattc atcccaccag tgcttcttct gcagacagta aaatatggtc ccagtgacca 60 

tctcaggtgc catgcttcca gcattttatc agacaaggct gaagacagca gacattaaac 120 

ttcagttgtg tgctccacag aacattagct gtcttcatca ttactttgca tctttcagtg 180 

ataggctgct tgacatgtta ggaacctgaa aatgatccca tcttgaccga atctcaaatg 240 

cccttcctga gcagcactga tgaaacagat ggagcacctg gatgttatct gctttggatc 300 

tggttctcag gaggaggagg agcagaaggc tgggcacaac ggtgtttgag gttctcaact 360 

gcccccagaa agaagggttg acttgattta cattgacttc aacttgatta tcttgatcta 420 

cttaactggc ttttcggctt ttatgcttca agccnccggc angantggct tccttntggt 480 



tggacacagt gggtccccaa 
ggtttttttc ctggaggttg 
gtaaagaact ctctcctcac 
tgaaatatca aaagcccata 
tggagaaatt tncaaaaaaa 



gatacctttc 300 

acacc tgggc 3 60 

aagaagcttc 420 

tccacatgtt 480 

actcaacatt 540 
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caacttgcan gncttttgac ttgggattta ac 



512 



<210> 282 
<211> 393 
<212> DNA 
<213> Homo sapiens 

<400> 282 

gctgtaagct ccttggaggc aaggattctg tctgcttcac ctctaaagct tcagcatggc 60 

atgtgccctg caaatggcag tgccagtgga catatgctag atgagtggat gaaggaccat 120 

cccacatcag ctcatcgtgg agtatgcagc tcagtcctcc cgcctctcag ggacaacttg 180 

gatcttcacc gttcttcgcc actaagaatt cnagtcatct acattcagag ggaagctgag 240 

caanctggct cctgcccaca ctggaaaatt tctctgccta aaccagcttc cctaagccga 3 00 

ggggagagtc caagatcccg aagatggcag ggccgtgcag gctcctggga ttaagacaca 360 

aacaagccct gttctcaggc tgacagtaaa tgg 393 

<210> 283 
<211> 139 
<212> DNA 
<213> Homo sapiens 

<400> 283 

ttactcatgt cagtaagcgt ttactgagta tctcctgcat cctgggcact tctccactcc 60 

aatgtgacag cagtgaatca aacgacagct agccctgccc gcaggcactt gcattccaga 120 

gagaggagac aaagaatac 13 9 

<210> 284 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<400> 284 

gtccttgatc tctgtggctg tgagacgatg aatctagggt gtcaccccag acaacgaggc 60 

tgcttcaaaa tcccaaagtc caaaggagga ctgcttcata agggaaggat tgtttatagg 120 

ttggtatact gtgcaaaatt aagtatagga ccaaaaacag ccaagacatt tgaaagttgg 180 

aaagttgatg gtaatggttt cctgggattg gaaggcagac ctcctccgct gatgagcaaa 240 

taatgaggct gtgctatgat caaggcattg tgacccctgt gacccacacg tacacatcca 3 00 

gaaggtctcc tggagccaga aagtctggga caacaggaaa accacaaaag aagaaaaaca 3 60 

gctcctgtct tagctgatta gccaaccttg cgaccttcta ccattggaac atgctctacc 420 

cttacttant aatncacttt cnggaccntg ggctntgtga cccctccccc ttgggataat 480 

aa 482 

<210> 285 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<400> 285 

cctccatgct ctgaggaacc ccaagcagct catggagagg cccacatgga ggggaagagg 60 

agctcccagc cagcattcaa cttgtcagta acggaagtga accatcttga aaggggatct 120 

tccagtctcc aatcaagccc ccagcccaca ctgcttggaa cagagaagcc gtccatgctg 180 

agccctattc aaattataga ttaatgagcc aaataaatga ttgttgctgt tttaagccac 240 

t 241 

<210> 286 
<211> 222 
<212> DNA 
<213> Homo sapiens 

<400> 286 

gaagtgggaa tgatgcatat tcaacgacgc ctacaaaaat tacttcagat tgttagtctc 60 
agaaacccac tggtggcctg aggggacatg caaaaagaag aggaacagga gcagagatgg 12 0 
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caaattatta aggtttcaag accttaaaag agacaatcaa agtattcaga ttctcagtaa 
aattaccaga ttaaatcaaa taaaacccca ccctttttcc ac 



180 
222 



<210> 287 
<211> 280 
<212> DNA 
<213> Homo sapiens 



<400> 287 

attaaatcaa gattatgtct gacaaccctc tcaaaatgat aaaaactaat ctgcagagaa 60 

aactggctgc agaggaaccg gctgcagagg aaccagctgc ttcctcctcg gaacatgaag 120 

aggtgaacag agagatgaag cctntttntc ctccctcacg tttntgaang atcaaaatca 180 

agggcancng ggagaaagaa taacaaaacc aacaaactgg aggtcaagga gagntttttt 240 

ctttttttta cctttctgcc ttttccattt ttaataaaca 2 80 



<210> 288 
<211> 435 
<212> DNA 
<213> Homo 



sapiens 



<400> 288 

ggcttatctc cttgttgtat ccagaatcat atgacaagca agagtcctag aatattttat 60 

ctacctaatc atcccactgc cttattccag aaagaatcta aggaggaatt tttatttctt 120 

cagtcaaaag atgcaggaaa gacatcctac ctcttggaag aatcattgga ctggacatcc 180 

aaacacctga gtcctagcct tgagtctgcc tctcacagca gtatgaccct gggcaagtcc 240 

ttgtggaata agggcatgga cagaatgatg tcagaggttc cttctagctc taatattcta 3 00 

cagtttcctt ttagttcaaa tttaaagaca aaatgtctaa cagtggttct tgtttgttat 3 60 

gaccagtgtt gncaaaagag aagttgtaca aagttttttt tgcctgnttt tcatgnatgg 420 

gggagggggg gggat 435 



<210> 289 
<211> 166 
<212> DNA 
<213> Homo sapiens 



<400> 289 

caaacaggaa caaaggaaca aagtgagagt ggagactgct gagtcatacc taggagaaga 6 0 

ctgcaactca cccagggagt gagtcttcac cctaactcac cggggaactg gaccgaccca 120 

gacaatttgt taagttctgt ttccattaaa cataattctg agtctg 16 6 



<210> 290 
<211> 507 
<212> DNA 
<213> Homo sapiens 



<400> 290 

gaatttatgt tgatgcagtt aactccttgg gccaaacagc actttttgtt gcggcgttat 60 

tgggccttag gaaattcgtt gatgttctgg tggattatgg atcagatcca aatcaaatgg 120 

gagccctgtg ttgcacgctg agccccgctc catggaatgc aggagcattg ccatggacat 180 

caattgtact catctccctc cccagccgct gctttgatgg gagcacccct gtccatgcag 240 

cagcattttc gggcaatcag tggatcctta gcaaactgct ggatgcagga ggtgacctgc 3 00 

gactccacga tgagaggggt caaaacccga agacttgggc tttgacagca ggaaaggagc 3 60 

gtagcaccca gatagtggag ttcatgcagc gctgtgcctc acacatgcag gccatcattc 420 

aaggcttntn tttccaactt cttgaaaaaa aaaaactccc cgcaggggtt tgtttacagc 480 

ccgtcctggn ggggttgggt tcttttt 507 

<210> 291 

<211> 192 

<212> DNA 

<213> Homo sapiens 



<400> 291 
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tgaatcgaac gccacactca ggtgagntga gaaaccctta ccgcgcgcac tgcaatgccc 60 

tccccttcac tctgcaccct ccacccccct gaaattctgc ccttaggcta cggggcgtcg 12 0 

tcctttcgca ccctccccca tgctgccaag ttgtagctat agctacaaat aaaaaaaaac 180 

cttgttttcc ag 192 

<210> 292 
<211> 408 
<212> DNA 
<213> Homo sapiens 



<400> 292 

gtggtagaag tctgtcttct ccccgtgctc ctaggaagac cttcatgtcc tccttgacca 60 

acaggggatg gtggaagtga ctctgtgtga cttgtgagac aagattctaa aagtcatgca 120 

cttctgcctt gttctcttgg gataactgct cttggaaccc agccattgca gtgaggaagt 180 

caaatagctc catggacatg ccatgtgtag gtgttctggc aaacagcccc aggtgaggtt 240 

ccaactgaca gccaacgtca accacctgac gagaatgagt cttccagcct tgatctgctg 300 

agtcatcgcc aactccagcc aatactgtaa ggagcaaaga tgagctgttc tgccaattgt 360 

agcccaaatt gcagattttt gaataaaata aatgactgtt atcattgt 408 

<210> 293 
<211> 316 
<212> DNA 

<213> Homo sapiens 



<400> 293 

aagtctcagc catgaaccta gcagtgaagg aggaaaacat 
acacaaagat gaacataaag aagaaacaca gactctggcc 
tgggggagac tggataaata atttaaaaca tttatttaac 
gataagtaca atagctaaag tgtgggcaaa gtgtcgcagg 
aacfctccaaa aaaatcttac atacttaacc ttttcccgac 
cagcataaca actcac 

<210> 294 
<211> 149 
<212> DNA 
<213> Homo sapiens 

<400> 294 

gctggtagca gaatggctgt tgttattcca agaggccctc ccggactata tcccagtgtg 60 

tatagtccag tgaaacgacg ggaaaactat gaccatgaag caaatctgga gcaccacctg 120 

attttttaag gtagatttta ccgaaacac 149 

<210> 295 

<211> 233 

<212> DNA 

<213> Homo sapiens 



cttatgtctt gttctctaca 60 

tggagaagtt cagtgtctgg 120 

acataattac agtgcaatat 180 

aacaggaata aagaggagac 240 

attttgacct gaaaataaat 3 00 



<400> 295 

gaaaagtgta ctggctcctg tcctggatca actcagaaaa tgaaacacat cggattctgt 60 

ccaggccggg cacagcaacc tggcccatnc atgtggagcc tgcagtgaca acttccgcta 120 

tctgcacaaa actggaggga ggctgggggt gctccaagta taagtttcct catcancaaa 180 

ccggaaagag aaagaccgac ctggaggctg gttatgggga taaaataaat atg 233 

<210> 296 
<211> 143 
<212> DNA 

<213> Homo sapiens 
<400> 296 

tgtacagagg aagaaccatt gtgaggataa agcaagaaga caaccgtctg caagccagga 60 
agggaaactt tatcagaaag caactgtgct ggaaccctga tcttagattt tgtagtcttt 120 
agaaaagaaa taaatattat ttt 143 
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<210> 297 
<211> 201 
<212> DNA 
<213> Homo sapiens 

<400> 297 

gtgatactgt ggctgacagt atttactgtt aaatggagtg gaagtgagaa aacaccacag 60 

aagggggcac ctanattcga accggggacc tcttgatctg cagtcaaatg ctctatccct 120 

gagccctacc ccctctacct gtaataagct tcttccgtgt ccacctacgg tgactcaata 180 

caatcaagtt ccacccacac g 201 

<210> 298 

<211> 77 

<212> DNA 

<213> Homo sapiens 

<400> 298 

gctctgatga ttcttaagca aagagatgga agatggaatt tcaaccccat ggagatctaa 60 
taaacttacc cagagtt 77 

<210> 299 
<211> 452 
<212> DNA 

<213> Homo sapiens 
<400> 299 

atgaaaaaac tgaggctggn aganggccnt gcccctgccc anantcatgn atntgnccta 60 

ngatggatgn ggaatnctgc cccaccantg gnggcnttat tattacaccc atattacana 120 

tntagaanac tgaggctcan cntgggtncc ttgccatgan cacacannna gangatanga 180 

gaggctggct ctgcctccta tgccnctcct gatccactct ccaaaccctc ctccagtccc 240 

ctgctccaag ccatcagtta ggatgattct tataagccgg gggtgtgaca tgccaaaggt 300 

gtctctaccc cacatactcc ctctggaanc aggacaaggt ttgcgtgagg tggacctggg 360 

ttctttctgg accagggact ttgcctccaa gctcatttcc tcatctgtaa aacaggaatc 42 0 

caaccaacgt cagcctgaat gggctgtggc tc 452 

<210> 300 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<400> 300 

ttcctcatca gaaggaagta cttcatcaat tacgtcctct tcatattcat caatttcttc 60 

cccatcatac tcatcttcgc ttcccacatc acttcctgac acctctgcct catcctccag 12 0 

atcgcttcag gagttcttct tcatcatcat cttcatggtc ttccagtgcc agatcattac 180 

cagagtcact gtgttcatcc tacaaaatca gcatcatatc caaattaagc agaataaaat 240 

gcgtcctcaa tgaaaaaagg atttataaac atctgcccaa atacctcatt ctaggaaatt 3 00 

gtttctgata agatgccaaa cttagaattc tcaagaactg aggggaaaaa aacacttgag 360 

ggcagcaata catggagctc aantatgaat acctttggtc ccttctacct cccctnatcc 420 

ttttcaaact catt 434 

<210> 301 
<211> 456 
<212> DNA 
<213> Homo sapiens 

<400> 301 

ctctcaatct ggggcatgac tttgaaggga aggttgctaa gcctcctaaa tcccgataca 6 0 

caccaatgcc gtttccccac tcatctggaa acctggtggg ncctgcccaa acgcctgtat 12 0 

gccaattcca cctggctgct tggcgaccca ccatgcccac attttccact caagcctttc 180 

anatctgctt tgggcacctg aagacagaga gaatcatctt tcaagagtca gaaactttgc 240 

acgtgccatt ccctctgctt agaatgcttt tccctttctc ccaattgcct tatcatcagc 300 

ctgggaaaat atttatttcg gctcctaaaa tctcagatat cacttctcca ggagctttcc 3 60 
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cagatgcctc acttgattcc agaaggagct atcgccactt ttgcctggcg agtaccgttt 420 
tcaccgttac acttatacgc tatggcaatt tattgg 456 

<210> 302 
<211> 187 
<212> DNA 
<213> Homo sapiens 

<400> 302 

tgactatatg acgtgtgatg gcccaagact gagtcaagaa gcagatgcaa gaatctagct 60 

gactttcagg aaattagacc ttaaagcgac ttgcaaaaat gaaaaacgaa gcctcttcca 120 

aattttttgt tttggaaaat tagttatttt tcataaaaaa cttacattaa agtatttatg 180 

tcaaggt 187 

<210> 303 
<211> 449 
<212> DNA 
<213> Homo sapiens 

<400> 303 

tttcaggttt taggatgacc agtgagatgg tcagaacttc agaaccttcc aaggtgatgg 60 

gtcattcaag ctccaggaac gtcaaggcct caacagtttg gacataattt taagcaacac 120 

atataagacc cacaggtctc cactgatatg actggggatc tcatgaagaa actactcgac 180 

aaagacagat actggaggga tagaagagtc tatgaagtac agaaaagagg aaagatctgc 240 

aaacaattcg gtgtcttctt ttaacttgaa actcattcta cccactgcta cagctaggta 300 

ctgtgctctt gctcagattg ctggagggtt ttgttgngat gatctccttc aatacatcaa 360 

tactataagt tctataanaa tcatctcaga gcttgtttan aactcatttt ttttcttttt 420 

ctgggntatg cccttataat attcattta 449 

<210> 304 
<211> 309 
<212> DNA 
<213> Homo sapiens 

<400> 304 

gtggggtctt tcaccggcca tgtccctggc tgactgtttt cctgctgatc ctgaccagcg 60 

tccccggcag ccatggcctg cattcgtgtt ggtccctcct cctgcagccc cgaggaggca 120 

gggctgtctg tggatcccag atcggttgtc ggaaggcccg gaagaggaga gctgccctcc 180 

accaccactg tctcctcctc ctggacaaca gagtcagaac actgctgaga tggggtgaag 240 

cataattgtt gcactgagac tcaaaactac aggcaagaag gtttgaaaat acagaaacat 300 

ttcacgaat 3Q9 

<210> 305 
<211> 174 
<212> DNA 
<213> Homo sapiens 

<400> 305 

gatgatgctg cccttaatgc tcagctgatt acagactaaa cacaaaagtt cccagaggaa 60 

aatggtggac ttgggagctg ctgcctcagg aggatcttga gtgttagtgg ttcctcccta 120 

tcagatgtac ctaatgccca ggatttaata aaggatcatt cccattccac cacc 174 

<210> 306 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<400> 306 

gagccccttt cctggacaca ctcgtgtctt tcccagggaa tgggaagaaa caaaaggatg 60 

atgacatgac acctaataag tctggatctg gaagtaagtt tgatctacgg ttcattaggc 120 

tggagcagaa aaaaaagaaa gggtccggta tgttcgcctg tgtgccaggt atggtgttac 180 

gccactcatg tgccttatat tccctacaac ccctcacccc aatttatcac ttcaaaaatg 240 
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ataaaagctg agacttggag aaactagtaa ctaaccaaaa gtcacccaag aaggaggtgg 300 

caagctaaga tcaagcccca ctttggtggg agctaagagt agcccttggt agagtcatgg 360 

ggttggctaa ttcttgcctt tggaacctgt ttctatctcc attcagttcc tttctttcct 42 0 

gtcagttgga ctgtaaactc taagatcacg aaatttccct ttat 464 

<210> 307 
<211> 481 
<212> DNA 
<213> Homo sapiens 

<400> 307 

agctttgcta gccacgtgtg gttcctagac catcagcatc aacattacct ggaaagggcc 60 

tcttacagat gcagaatctc tgccccaacc cagacctatg gagttaaaac ctacgggatt 12 0 

tctagatgtg cgggagtgaa ggagctggtg gctatcagac ctcaaggtct ccaacaggac 180 

aagatcaaga gggattccac tcccacagac cactcactca ccctaggaag actgtgaaat 240 

gcctgtcctg gtgcttagtt tgaattgttg aaagaccatc tttacggcag aaatgctttg 3 00 

tcatttcact tgataagggc cttgggtttc aagccagttt actcttttct gtgagcattg 3 60 

aaagccccct ttttnatttg ctccgaggca ggattttgac ttcaaagcca aaataagaat 420 

ttaggaagaa aagaaaggga gggaggaaaa agggaagttt ggtccaggaa aatgaaaatg 480 

c 481 

<210> 308 
<211> 177 
<212> DNA 
<213> Homo sapiens 

<400> 308 

gggcaaaccc atgctttatg aagcctgatg cttacacaat tatgggagcc ttctttgaaa 60 

aaaaaatttc aaaattacaa atgcaaaatt aggtacaaaa gggaatattt acaatgagaa 12 0 

atcaccacaa atggcaagat ttaaacagct gacaaattaa acagcgcaaa atccagg 177 

<210> 309 
<211> 366 
<212> DNA 
<213> Homo sapiens 

<400> 309 

gttgcaagaa agctcaagta gcctatggag aggatgcaag gcttccagct gatgccctca 60 

gccaggctca gtagcagcca gaactagcct accaacgaac ctgctgatca tgtgcataag 120 

ccaccttgaa cgtcgatcct cctgcctggt ggagccatcc cagctgatgc cacatgaagc 180 

agacacaagc tgtccctact aagctctgct caagttggat attcatgagt gaaataaatg 240 

actgttacta agtaattaat tttcgggtgg ctgttatgta gcagtagata attggaacaa 3 00 

agcttattga cataatacat ctatatcaca tcctccaatc cattttttta agtaataaaa 3 60 

gtggtg 366 

<210> 310 
<211> 292 
<212> DNA 
<213> Homo sapiens 

<400> 310 

gacccaaatg tgaataatgc caacagcttg ctgtcagccc tgaagtttcc tcagatgtct 60 

cataaacact ggaatcactt cacacgtttc tgaaatgtga ccacctctca ggaggagttg 120 

acaacactga gtaaccggaa gggaggaaca cttatcccac tgaaactggg ataaaggttg 180 

ccatgaatgc aagaggtgcc taaatctctt ggcatgggga cttaatgggg ccttatccct 240 

cctgctatat ggtagcaaaa taagaaaata aaaaccaaag taatatgcgt tc 292 

<210> 311 

<211> 195 

<212> DNA 

<213> Homo sapiens 
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<400> 311 



atgaaagaaa gagaagtccc taagtagaaa 
gtaaggacat catctgtccc tggactctta 
gtcatcccag ccttcaaaag agccaccgtg 
tgtctttcgt aaccc 



ctgcaagggc 
agcggaagat 

ggggggaaat 



caagcagaac attataccat 
catgcaaata gtggactgaa 
aacagaaagg gataaaaagc 



60 
120 
180 
195 



<210> 312 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<400> 312 

aacagttctg gaggccggaa gtctgaaagc agtatcagta ggtggaaatc agggtgccca 60 

gcttctggtg gctgtgacat tccttggctt aagctgagat ggcagtcaat gagagtgact 120 

gcagaaagtt tcagaaggac acatgggaat catttaacca ggccaataaa atcagctatt 180 

tatacacttc ccccggaaga catagccctt gcttcactgt ctgaaggaga gaaaatgcaa 240 

aagtgataaa ggcatgaaaa agtcatattc ctgagctaca agagagaaac tgaggacagt 300 

ggagatgaga ataaaatccc taaagcttaa aggatgctgg atctggattc tactggatgg 360 

nggngcttna aaagnggact gncctatcct tttcacatat gttagaggtc acacacaggg 420 

agcccacaga ccccagcatg ccaataaacg tgtttcttgt gacccataat aaatg 475 

<210> 313 
<211> 425 
<212> DNA 
<213> Homo sapiens 

<400> 313 

gtctactctg tgaaaaggaa atgatcatac ataattcacg catttgctgt acggatttaa 60 

ttaaatgata gacaaaaagt agagtggcac aagtcaaact caaaaaatag taacaacaaa 12 0 

atcaatttca aaataagcaa cagcaataaa tgttacctac tattttacga atgaaaatac 180 

tgagaccaat aaaggcatta tagtatacat agccttggaa tcagaagacc aagaacatac 240 

aagagaacat agccttggaa tcagaagacc aagaaacaat ttaactctgc ccctctagag 300 

ctctgagaac ttgggcaagc cttttaccct ctgtgagttt cagtttcctc atttatttaa 360 

ttggaatnat aattccngat cacctgaatg taatgaaaat taaacatcct tatgtaggtg 42 0 

aaacc 425 

<210> 314 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<400> 314 

gtagaagatt ctgaggccct gggcgggaac tagtaagcca caatctggaa gagtctttta 60 

ccaccatgtg gaggagaact agagcactca tgttgaacta ttaccganaa aagtaatcaa 120 

cttcttttgt gttaagncac tgaaaggcaa gtgttgattt gttgcagnaa tnggggctcc 180 

cttaacacac ctgtcagccg ggccaaactc tatcacagca taaataatct tttccttaaa 240 

taaatacagg taaaagaata aagtagacct aaatgcatta atatgaatat aggctcaaag 300 

caaaatgttg ggctataaat gttcagagtg ataatttttt aagttgatgn gtaatttagn 360 

nccagtaaat tagaataaaa cctaaatgtc agttcaaatg gaatttttta catgttcatg 420 

cccctgtata atcacctccc anaaccaaca tagaaaatac ttcatataaa atgttggg 478 

<210> 315 
<211> 325 
<212> DNA 
<213> Homo sapiens 

<400> 315 

tggcaagaca ctggcctgat agaccaggag ctactccacc accagcagct acaaggcctt 60 

ggcagaatgg aataacagca aacaacattg gaggaggact tgtctgggag agcagccatt 120 

ttaaagaaga gcacattaag tcacaaacag tcgcagctga tctactttgc agcatcgcca 180 

tacatgccta actaaatatt gaaatcccgg gaaaaactca ctgtgcatca tgttccagaa 240 

actagctttg caaacagtct tttcagatgt gtacattttg tgtatttgag gcatataata 3 00 
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tatatatttt cctccatgtt caccc 325 

<210> 316 
<211> 275 
<212> DNA 
<213> Homo sapiens 

<400> 316 

acgccatctc caaatacggt cacattgggg gttagtactt caacatatga atctgaagga 60 

gagacacaat tcagtcctta acacagtgct ttatggattg tatctgcatc ttccatctta 120 

tcaccaccca aatccagcac ctgaattggt gagtgttgcc agtgagaggc caagagccag 180 

aagagcctgc ttctgcttgc agaggatgca cagttgtaat agttcgtttt catgctgctg 240 

ataaagacat acccaggact gggtaattta caatc 275 

<210> 317 
<211> 352 
<212> DNA 
<213> Homo sapiens 

<400> 317 

gttcgtgaat gactgtggtn tcanantgac tgccaatgnc gactcctgat accataaaag 60 

gaaagactcc tgtctgaagg atgtgccttt atcccagaca ctgacaaaca cctttgccaa 12 0 

gagagttcag aaacgactgc aaaccccaac ccaagcaact ggactctgga aaacagctca 180 

tgaaatctca gcatctgcct tgtctggtga gcctcgtagg gcactcacct ctattacgga 240 

ggcttgatgg cagcggcttg gtttgaactc tgtattactt atctattgct gcataagcga 300 

attaccccaa agcttagccc gcttaaaaca acacgcattt attatattca ac 352 

<210> 318 
<211> 243 
<212> DNA 
<213> Homo sapiens 

<400> 318 

tcacaacatg ggggtttggg ttggttttgg gatgggcaca cttntgcccc tgggacaatg 60 

ggaatggtgg ntttacccag gcnttngggg anaanangtg ggnaattcna ccccctngga 120 

tgctnacaaa ccntggcaaa tcttancatt ttcccctnat tgaaaccggn tgccccttnc 180 

cttantaact gcccttggac ttacctcacc attttgtgtg gccttaaatn aagaatttgg 240 

ggg 243 

<210> 319 
<211> 476 
<212> DNA 
<213> Homo sapiens 

<400> 319 

actcagagaa gaatggaggc agaggctgga gaggaggctg aggatgctgg acaaccctgt 60 

tgagaaggaa aagccggcac acaccgcgga ctgagctctg cctgcctcac cgacttcaaa 120 

gatagcaagc gaccactttt ctaggggaaa aaaactaaca ctcaagttgt gctgatttac 180 

taaacaggac gctctctatt tgtgcttcca tttgctaggg gatttacatg tgaaacctcc 240 

cccagtgcta atgggagtta ttatcctgct caatcccctc cgcacagagg acaggatgac 300 

cgcaagtggg ataggacgct tgggctattt aataaaagaa ctcttggaat taacacttct 360 

tcanggctca cagacccatg tagcctagta tatttccaca tttccttgtc attttgaaat 420 

ggttcaagtc ttgagacatt tgaagngttt tcttctaagc ttaccgaggg caatgg 476 

<210> 320 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 320 

aggaatcaaa agaaggagga agaatagaat gatttggagg aaaagaagga gaaagtagag 60 
gagttg 66 



69 



CAl -201292.1 



<210> 321 
<211> 226 
<212> DNA 

<213> Homo sapiens 
<400> 321 

ggtggcccgg cctccctgtt ccatcttctg agaggagcta tacccatttt gcaccctgaa 6 0 

cctccaaact cagaagtctc tgaggagccc tgaataggag aaaatgtggc tgaaaatgaa 120 
gtggaaaatc agtgtgataa ccaaatcaag atcacgcctc gctgggaccc tgtcacacta 180 
aagcttccag agcatagtcg tttttaaaat ctgtaatagt acctgg 22 6 

<210> 322 
<211> 465 
<212> DNA 

<213> Homo sapiens 
<400> 322 

gaagccaagt gggaagatcc ttgctggttt ctccctctga ggaagaagga aaatgccatg 6 0 

actcccacta tggcctctct tggaaccata ttttgaggta ccctacttcc ttcttgagtg 120 

tcagcagagc aactgtggga ctggcatgag atttggtcat ttctaggaga gcgaatgcct 180 

tttgcctctt tgatgagaaa actagacgag acattgttta gaaattcttg agctcagact 240 

ttngcattat gacaacgtgc attcaaatct gccccagcca cttgcgagct gggacctaaa 3 00 

gccgtgagct tctggttgtt tatctataac aagcggatcc cagtacctac ctcataaggc 360 

tgntgngagg gattaaaata aaatgcatct atcagccagc ttgcaggtct gcacttaaca 420 

ggggctcang tgcaatacct tgataagttt tgatagtttg ggata 465 

<210> 323 
<211> 303 
<212> DNA 

<213> Homo sapiens 

<400> 323 
cnaacctgnt angtntcatc tnatncaant 
agctgacact ttctcaactt cacctggatg 
cctgcagctg ccatctccct gccaatttaa 
tgagagctgg aaggaagcca ggtcctcgct 
ctcagcctcc cacctctaga tttcctgtga 
get 

<210> 324 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<400> 324 

aatcaagaaa acaattcaat aagaatccat tttccttggt 
actggttatt taaccaaagc ttcatctgaa atggcatatt 
gctttgagga atttaagtgg accttataaa gttgataaag 
ctagtacctc atntacttgg ttcccttagg agectaggan 
tcaagaagag agaaattcac tcattttatg cacatntnac 
tcntnggntg gggttccccg ntttnaaagn gtttttaaaa 
taaacatttc ncccnaagnn cacctttaaa accctttttg 
ntttgegena aaatccgggn cccngggaaa aactaaaa 

<210> 325 
<211> 212 
<212> DNA 

<213> Homo sapiens 
<400> 325 

gagnnactgc tcaaacaaga acacaaaaat ntntnangat cctacnacag ngggttggnc 60 
ncagtgcacg ctntgtatac ctatcagaca aaagaaaatg tcaagcaagt anaacagaga 120 



gtggcaaccn ntnecttgne 
gacactgaag tccaggatgg 
ggatgaagee aatgeccagg 
gacattgttg acacactgca 
cttgggaaaa taaatttctg 



cannngctgg 60 

gatgetgeta 120 

atggcagagc 180 

tcagccatct 240 

tatttgtaaa 300 



aacaggacac aattgaaaac 60 

ttaegggata tgacgagact 120 

agccccttag aaagactggc 180 

cctnaanatn ttnggggacc 240 

nggcatagtc tangggggaa 3 00 

ccaanttnng gggtnntttn 360 

aacnettttt ttttttttgt 420 
458 
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cttagctgtg acagctaaaa natttataaa gtcatgcttc ccatcnaacc tatctggact 
tatcaacagn atgcntccag cagttattcc cc 



180 
212 



<210> 326 
<211> 483 
<212> DNA 
<213> Homo sapie 



<400> 326 

gtgtaggtct tgcctttcca gggataagtg gccacatagt tcgccgtgtt ccccgcagtt 60 

attccagtac atgttttata cttttggtat gtttgttgat cacggtgatg gtgattgctc 120 

tcaacacaat gtctacttct cctcgacggt caaggaggga aatagacaga gcccagaggt 180 

ggccagccat ggttcctcaa gacctgccaa gaagagtgca ggccaccaga gtctttgcag 240 

gtataattga ttaaagatct caagatgaag tcatcctaga tttaaatcat ccacatggag 300 

ctgccttcaa aggcacagct gcaggcgagg gtacatttct aaatcccang actagtggcc 360 

ttgttagaaa anaanaaccc ggggngaccc ccngagaaag gagatgtgaa gatggaggca 420 

gagactggag tgatacagct ccaagccaag gatcaccagc catttcaaga agctaggcaa 480 

gaa 433 

<210> 327 
<211> 272 
<212> DNA 
<213> Homo sapiens 



<400> 327 

agatgcagtt ttgccatgtt gcccaaactg gtctcgaact cctgagctca aagcaatttg 60 

cccgccttgg cctcccaagc tggaatgaca gacgtgagcc actgcacccc gccaacattg 12 0 

gcattctctg ctgccttctc tggactgagg aacttcactc aacaactggg ctcacagccc 180 

tttttccaca gagattttgt ggaatagcct ttttgtctca tgcctgcttt tcatttattt 240 

gcttgtttga gataaattaa aagcagaaaa tg 272 

<210> 328 
<211> 450 
<212> DNA 
<213> Homo sapiens 



<400> 328 

ntatgacaac aaaaccaccn tggggcccaa acctggaagc cgncngctat ggaggaccct 60 

ggaagcangc anagaaaggt ttggagtttt cantgcgatg acaccagcgt gcctgcggaa 120 

gnggntgtgt ntactnttgc ctccttnccc acccaattcc gtcccaggag cccagggatg 180 

gaggcccaag anacggatnc cacaggagcc agcacccact ccaccccagg agctcagcaa 240 

acatccacag agtgaacatt ccaagcaaca tagtccagga gccacgttcc agccatgggg 3 00 

cctctgcact gctgtcctct tcacatggcc tgcccttccc ccagaaagag agaagaggcc 3 60 

ctctctggtt gtcccatcaa aactccaccc ttctctcacc ctcctcccag ctgtatccct 420 

tctctgcagc cctaacatgc attccacttt 450 

<210> 329 
<211> 479 
<212> DNA 
<213> Homo sapiens 



<400> 329 

ggtgtgggca cacacactct ctgaacagca gaacttctgt ctgagagtag aagctgaaga 60 

gcagaagaga cactatggga atcaggaaag aggaggtgat ctgggccagc agttgaagca 120 

cattgaaacg aagaagaagg ctgacttctc aggagctgcc tggatgctgg cctcctgggg 180 

aactggaact ccagtttgaa ctgaaattcc ctgtatactt gtcaggaaca tccactggac 240 

tgtgggttcc ttggtacaaa aactaagtat ccccatgcct gccacagtgc ctggagcaga 3 00 

acagacactc aaatatttaa taacgtatga ctgattgtgt attacccgcg gcatcaatag 3 60 

aagacacaca ggggggngga ggataaattt gggttaaaaa anaaggctaa atctgntggt 420 

gntgcttcac atganaatga nagtctttcg gtttatggtg gctccccggc caaaacacc 479 

<210> 330 
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<211> 171 
<212> DNA 
<213> Homo sapiens 

<400> 330 

gaattcatga cactgaagct acccaacttc taccatgcct ataacatgat caccctagga 
agtggcagag taacccgagg gaagaagcct ggatacctga atgactatat gaaacacagn 
tgccttaata ccctcgatca ctcactacgg aactctgtaa taaagtatat t 

<210> 331 
<211> 251 
<212> DNA 
<213> Homo sapiens 

<400> 331 

atgctatcta tacttatgtt aagcatcttc agagacacca tggatgatct tcattctgaa 
tcccaggaag aattctggaa agcaatcacc tacctcttga tattttctcc gtcagatatt 
acctaaagat ctttttggga cctggagaaa agggaaggta gaactgattg ataacttcta 
tttatataga attaaaagaa tatgaaaagt ttagataaag gagcataaat aaaaaccttc 
tactggcaaa c 

<210> 332 
<211> 446 
<212> DNA 
<213> Homo sapiens 

<400> 332 

gttgtctgcc aacgctaact ggccagctct gacaggaggt gcgtggccca ggaggagcca 
tcaggccagt tctctgggat actgctgtgt ctccagctct gcagtttgct ctgcgtcact 
cagcggcaga cggagaggca gacacgagcc ccttgtgagc cctcctcctt accgtcatct 
cacaatgctc tgaaataagg aggcaaatgg ctgaggtccc ctcagttgaa gatgtgattg 
agttctatct accagaagca tatgcctcct ggaagcctgg ttctaacacc tctcacaaaa 
tccttcaagc acttttttct gttccaaggt ttgcttatgg gggacccnaa ggaaagggct 
tnananccct aaagatttgc tgagtcatat gaggggccag caaacttttc ctgtaaaggg 
tcagataata aacattttaa gctttg 

<210> 333 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<400> 333 

gtgttgatca tgaaacattt tcaaccaaaa atagtagagc caaatttgag cattgccaac 

ctccacccac ctcccttcat cacatggatt tgttccaaac aacttctggc ccttcaagca 

aggaaacact ccttcaaaag atgaacactt gccatcacta acattgtgcc acaggctctt 

aagacaattt caaatggaaa tgcaacgaag ttttgctaat ggtagcatca ctgaaataag 

tgtagtgtct caaaagactc ctatgtgatg gtgaagaatt aagtgtgtat gtttaggcac 

aagttttatt tttcaaagaa tatttcatct tgctatttgn cgaatgaaat cttaaggaat 

aaaaagngnc ttaagttttt ccaaattgca aaaaggaatt accatcttcc cactgactcc 

atgaatgcca aagtcactga aaactaagct taatgactgt tgaatcaatt tccaaagatg 
taaaattctg ctttaata 



<210> 334 
<211> 345 
<212> DNA 
<213> Homo sapiens 

<400> 334 
gcaaaataca tgggaaaaac aaaacaaaac 
tgagctcatt ttctgggtga atgcttccct 
caggaagtta gagaattgaa aaatagagaa 
ctgactaagt gggagactgg ataaaagcct 



agtgaaccaa gaactcaagg gagaatcttt 
cttacccgac caccagaaca gaggagcttc 
aaagaatgag tcacaagagg atcttatcat 
tgtaaaatca ttgcagctta tatacatgtg 



tatggttatc aagtagcatt ctatttctca aattaagcat ataccgcant tattttgtga 300 
gactataaan ttcttctaga aagaaataaa gaacattaaa attct 345 

<210> 335 

<211> 297 

<212> DNA 

<213> Homo sapiens 

<400> 335 

aggacttgct cagaacaagg gaagaagatg actatgcagc tgctcggtaa cagcgtctag 60 

tcacactctg agatactgag gtcagcaaga acagaggatg cacactatgt cccatcttgc 120 

ctttctgccc agaaagtctc agttactgga aaagcttcag aaatatttac caaaaaatcc 180 

atttgaaatc ctgaaattct acttctcaga aaaacagtat tactcttgtc tagaaataac 240 

attcaggcct caaagtgcta tactgtcatt acttctaaaa ataaactgag caaatcc 297 

<210> 336 
<211> 175 
<212> DNA 
<213> Homo sapiens 

<400> 336 

tattgtttct aaagaaacta tgaagcaatt caaccagagg agaacaacta ctgtgggact 60 

gcagatgatc ttagcctgga agctgcataa ccctcctacc agatcaaatc attcagcatc 120 

catcttaaat gagaaattta agtaactaaa aataataaat ataaataatt aaaat 175 

<210> 337 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<400> 337 

attcaagaga gtgccaaagg aaacaacagg acagaaggag acatgaggaa gagatgggac 60 

agacagcact caaccctgag cagacgtgag gggcaaaaga aaaggcaaca ttaaggaccc 120 

attcaagttt caagtctcag cgtcccagag gatggtgagg atacagcaaa aatggagagt 180 

gcaaaaggag aaaggcagtt gaatgtgaag ataacggggt cttcggggcc tacctactaa 240 

gtctggtggg ataaccctgt taaatgggaa gagggaggcc tttcttggta cattttagga 300 

ggaaaaaaat ggctgcctgg aaagttcata taccagcagc aaaaagaaaa gcnnaatggg 3 60 

attaaaaaat nttaaaagcc cttcacnagg aggttaagtt ntggcgggtg tgcccatcag 420 

agaccagcag agacaactgg ctctccggcc tgagttcgcc tacatcagaa ctagcacatc 480 

tctctgtcta atttct 496 

<210> 338 
<211> 371 
<212> DNA 
<213> Homo sapiens 

<400> 338 

gtggtcaaat gtgtgggagt aaaatgtgtg tttgaaatgc cttcccagga ctcagtatgg 60 

ctcattttcc tccttgccat gagctgcatg tccccatgat tcggggcagc ccgcctaggt 120 

gcctgttcct ggctatcaga agagcacagt gaagtcctcc tgcccctgag aagatcgaag 180 

actctgctgt ggtcaaggtt ccttctccag ccatatgtgt tgtctaggat tagacttttc 240 

aaacagtggc caggccttct gaggtcacat gtagcagtaa aagcaagctg tggccaaaaa 3 00 

aaaaaaggnc ngnggggncn attnannttg gacttaancn gggngnactt nntnaaaagg 3 60 

gggggactcc c 37 l 

<210> 339 
<211> 479 
<212> DNA 
<213> Homo sapiens 

<400> 339 

actgaggatc ttctgaattg gcggcctcta catatgcttc tgctaaggag catgtattca 60 
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ctcaacaagc attttaaacc cccagcaagg 
gaagagatga ggtggctcct gcttcccaaa 
gacttctcat aacctggaac atcttttctc 
ccaaatccaa ggaccctgcc tgcccgatgg 
gccacgaggc ttcatctacc tccaatagtc 
nntgcnttgt tttcccagga aagccttnct 
ccatgaacat tcacaccctg tacctctctt 



cacaagctac 
gagtgtggtc 
gaggccaaag 
gtgctcctct 
acggagtctc 
tgacctttca 
ttcacagcac 



aagggttaca 
cagggaaagg 
aggtgatccc 
gctgagcagc 
tccatgtgcc 
gatcaagtca 
ttatcccaag 



agagacacaa 
aataggcctg 
aagtgagagg 
caaaggcagt 
nnttgggttt 
catccacgta 
agaaactcc 



120 
180 
240 
300 
360 
420 
479 



<210> 340 
<211> 481 
<212> DNA 

<213> Homo sapiens 
<400> 340 

cagagtgtgg gaccaaggac aaattacaga agcacagcag agaaggttgc ccggttcccc 60 
gtttgcctat gaagttatgt agtgagcaaa taagaggaca ctggagcaca gcgctgctta 120 
gagccgaggc tcagtaaact tttgttcact gatgaatgaa tgtattaagc tgaccagctc 180 
aatttgattc ataaagaaat agccttaggg cttttctgag gaagaacaca acatactttc 240 
aatccaactt tttaaaaaat aaaacatgat tacacactcc taaataaata ttttcagaaa 300 
gtttgcctat atgtcaaaga tttctaggat ttggaagcca gtatgttcgc aagttgtgag 360 
gacatctgng ttattctcaa cacttccttg gcaaaacnan ngngtcctta cctgaaagcc 420 
tgaaacaata taaaatgcaa agctgacatc cccctgcctc ggcaactgca ctttcaccca 480 



<210> 341 
<211> 306 
<212> DNA 

<213> Homo sapiens 
<400> 341 

aaggaaagat ggaaaagagg agttatcatt tctttctcaa gatcctggcc ccatgagcct 60 
cagtgtagcc ctagttcctg ggatcagcac caacaggcag ggaggagagg ctctggcgcc 120 
ctgcagacag caccaggttc ttggcatcag gagctggata cagagtccct gataatccca 180 
gccacagaat atttcaaact caccgacatg tcctctaaat atcagatatg aaaaggcttc 240 
cactcttgca cctgtcttgc tattatttta cagatgtgtt ctaaaagcta taaagacgga 300 
aatcac 306 

<210> 342 
<211> 471 
<212> DNA 
<213> Homo sapiens 

<400> 342 

ataatacaga catgtacccc accacacaca atgtaaactg caaaagcaaa aaaccgagat 60 

gcctcgtcca cagttcaacc ctctgcgaac agagccatcc tggataaaag ggctgctgtc 120 

atgattgcca taaactgagt ggcctgaaac aacagagtca gaaatcaagg catctgcagg 180 

gccatgctgt ctccgaaggc tcggaatatg gacccctcct tgcctcctcc tagactctgg 240 

gcaggctgca gatccagaaa gccgaagctg cagcaagtcg gaaggcgcgc cgcaggagga 300 

gttccttcct caggagactg cagtctttgc tcttacggnc tttgaaaaan atggnatnaa 360 

nccccccacn ctatggaggg taacccgctg cattcaaagt ctacagattt aactattaat 420 

catatctaaa aaacagcctc acagaaacac cagactggtg tttgaacaaa a 471 

<210> 343 
<211> 463 
<212> DNA 

<213> Homo sapiens 
<400> 343 

catgtctttg cagctcttct caccaagaat tggagtctat tttctcaact cattaaatct 60 
gagctggctg tgtgacttgc tttggccaaa aagactttag caaataagat ataagcacaa 120 
gcagaggttt gaaaagtgct ggttcgctgg ggcttactgt attactgctc ttgaaatgct 180 
gagatgacca tgtgaatgaa tccaaggaag cctcctggaa gatgagaatg ctgcatagaa 240 



g 



481 
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gaaaacagag gtctccagct gacagcctgc 
tggatcatct tgtcaccagc tgacctccca 
aaagatgagc tgagccagtc cagtgtaaaa 
agctgaataa aanggttgtt ttaagccaaa 



caaacactag aaatgtgaat gaggccattc 
gctgactatc agtgcatgag caaacccaga 
aaaatggccc agccanccca cagaataatg 
aaaaaaaagg gcc 



300 
360 
420 
463 



<210> 344 
<211> 149 
<212> DNA 
<213> Homo sapiens 

<400> 344 

gagtggaagc agcctgaggc cctcatccaa tgcagatgtc tgtgccgtgc gtcttgtcca 60 

gcctgcagaa ccatgagcca aataaacctc ttttcactac ccaaaaaaaa aaaggncagn 120 

gnggccaatt cagnttggac ttaaccagg 149 

<210> 345 
<211> 407 
<212> DNA 

<213> Homo sapiens 
<400> 345 

tatatgaaga aatctggcct cccacagaga cggatttgga aacaagagga ctacacagac 60 
cctctgacag tctcttgggg gacacaatgg cttgccaagg gatccttgat acacacttcg 120 
agaaccactt gcatagacca tcaccatcat cctggaaggt tttttcaaaa aagaccacta 180 
ctctnacttt cttnaanaat aacattgcct tttcttgatc ttnatggatg gggaatcatn 240 
antgacntgc tnntttgaaa taaaggacnt ttgaaaatan aaacntggac ctatgaanat 3 00 

atnaatcgga tgaagattct gaagngccct gatgntacta tttatggnct gnttaaatat 3 60 

tccaacttaa tgggaaggcc ctnggggggg gatttggcca cccttgg 407 

<210> 346 
<211> 363 
<212> DNA 
<213> Homo sapiens 

<400> 346 

gatgctgtct tctgatgaaa acagaatcan gaatgagtga aacatggaag tttgaaaaga 60 

gtgaacatca acactggaaa ctcaagagtg tgttaaacag agaaaattaa tagaaaccag 120 

gaaacactta aggtntattt gaagtttgtt gtcttgaatt gatgtattaa ttaactctgg 180 

aatcaattta ctgtatttgg tgaacccagc tttcagtgga gttcttctta attttcgcct 240 

actgttctac ttgttccaaa tgtgtgtatc atgtattttt tcttttagat ttttctacct 3 00 

aattagcttt gattctgtca tcaggattga ttttggctaa aataaaacac atatatgtct 3 60 



<210> 347 
<211> 383 
<212> DNA 

<213> Homo sapiens 
<400> 347 

gacttgctgt gctcagatcc tccattcaag agagctacag acacgggggt gctggtgagc 60 
aggagccgag accatctggg gtgggaccga ccaagagttt gaggtgtcca gggggtgacn 120 
gtgaagatga cctatcgcag agggtccctt ctcattcacg ctctgaagtc tgcacagggg 180 
caggggctac cgtgctccat tcagtttggc ctctgttgta tcagccagag gccagcagaa 240 
ctctatggtc actcccccgt gtcacggaca atttgccacc tccaccggca gcccagggct 300 
ctgcctgaat attctcgcct gatcgtagga ttgtggggag ggatattctc attgatctct 360 
aaagaaaata ttggtcgctt ttt 383 

<210> 348 
<211> 479 
<212> DNA 
<213> Homo sapiens 



ttt 



363 
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<400> 348 

gatgatcatt cttgaaacca gatcccatat caagagaaag tcaagtaatc atgaagagag 60 

gccacatgaa ggtgttctgg ccagcagtgc cagctgaatc tcagttgcaa gccagcatga 120 

ccaccagaca gggaagtgag caaaccttca acggaggcaa gccccagcct tcaaaccacc 180 

ccagccgatg catggggcaa ggacgagcca ccactggcaa atgtgcccaa actgcaggtt 240 

caggaggaaa ataaatgatg gtggtgtttc cagtcattaa gttttatggt ggtttttaag 300 

gcaaccaaag acaactaaga acatttactc tggccaataa aaaaatgaat gaaagtgatg 360 

tgtcacttcc atgtggaaag ngttcattcg ccagtagtta agacattgga agcaagcttt 420 

tccttcttgg tgcaccaatt angaaaagaa gtggtgttgg gggatgtgcc ctccttcat 479 



<210> 349 
<211> 614 
<212> DNA 
<213> Homo 



sapiens 



<400> 349 

cagaaactga gcccaggctc taccgacctt taaactacaa cagagctttt naggagaaat 60 

gcggaagaga cggcntttcc accccgggac cttaccagaa aacccgcaca ccccagncac 120 

aattggcttc cttcattcaa gccnagaaaa agggactccn acttttcacc accaggggan 180 

gccccccttt cttggtggct tgggccaant tgcaaaaagg cctngtttca ttgggcattn 240 

ccacaagggt ngggggggaa nttgggnccc ccaacccttc ctttcttang cacctttnan 300 

aagnggttnc cttnttgttg ggcaagnaac aacccaattg gcnttaaggg ttttcntctt 3 60 

ttttnccaaa cttnccttgt ttngggtctt ggggcnnaag gtggnaccgg aatcaattct 420 

tttccacttt gccatttaaa ttnaagtnaa gttcaacccc ngaaacaatt tccttaatac 480 

cttggggccc cccccaaatt tncttttttt aaaanaaacc aaagtttggg cctntccccc 540 

ccacttgggg aaatttattt tctaaaatat tngggaacnt tagaaattaa aaanttggaa 600 

gaaactttgg cccc 614 

<210> 350 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 350 

ataacatgtt tcaaagtggc aaatttcccc taagaattgg aaaaatggat aatacggatt 60 

ggggttggag agcccgggat tctgattaaa catggaatct gagaactggc agaaagcctg 120 

gaactgatgg aagagagggc tctagggcct ccatactaaa tggtgaacta ggaactataa 180 

aagagataat gtggtgaaga gcttcagcca tcaagttatt ctaaaaatga agtagggcat 240 

tttatatgtg gagagaaggg cactgattat tatctgacta ttgctaatat gtcccataga 3 00 

acttatttgg aataattttt tactattaat ttgaacaaca gcagtgagac tctttatatg 360 

tataataaag ctaattttac 3 80 

<210> 351 
<211> 373 
<212> DNA 
<213> Homo sapiens 



<400> 351 

gtcagatttc ctgcaaggag gatctacagg ggcccagcac taccttgaag gccgtgaaca - 60 

gccacagagg gaaagccgcc ttgagtatgg agcaagactt cctcagacag gtctcatttg 120 

tgtcttccct tccagcagga ggaagacagc acctgcccag agtagtttta gagggcactg 180 

cactaaagaa ggagaactgc aggggaagat cgtgccctaa tggatgaaac atttcccaaa 240 

tggcctggct atctggagag atgaggactt gctcattagt agaagtttcc aggcaaagcc 300 

tggataagca tttgctgcag gggtggggga aggtgaaggt tganangana nctctaagat 360 

ttctttgcct tgg 373 

<210> 352 
<211> 405 
<212> DNA 
<213> Homo sapiens 



<400> 352 
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gctataaaga cgccttgaat cctcctccac gatacccgcc ccactatttg ttggcacagc 60 

tacgatgctg cttatggatt gttttcactc ctaaagacag tggcgcaagg caaggtgacc 12 0 

tggagcgagg ccatcctgag tgcccaccca gcgtcccagg agcctgttgg aatttggaag 180 

gacatttgcc tctgtttata aagactggct ttttgctgaa agccagggtc tcaaaaattt 240 

tgttttatta atagaagcta aaccccaaac atttggctct ttttcattcc atttcccctt 300 

tcacaatctt aactattccc aagacaatgg atacctctgc ctgtafccaag ggcngattgt 360 

caataanaaa gtcaacagga aataaacntt ntttttttca aaatt 405 

<210> 353 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<400> 353 

ctgatttaag ttanttccng gggnccnaaa cctngnaaag gtttttnatt agggcagcaa 60 

agggaaccgg ggaaccactg angaggagca gcagaaaact tcacagcttc tttgggtggg 120 

cagcagactt cagatttact ggaagccaag aaaggggaag acagcagcag gagggcttga 180 

ccagctagct aaataagtta agccatggaa agaagcagaa gaaggaagct caagaaatct 240 

cagcaacaaa cactcatgga cttttttcta aaaatggaaa tttaaaactt tctcgaccat 300 

gacccacaag aaatacattt tacacgttgc atccaggaca tagcaatatg cctgtgagcc 360 

actttgtggg tgaagggttt ncatggtgag cttgtttaag ggaacatggc ccccnggggt 420 

nctccttttg gagattcccc ctggatttac tggatcaaag tctt 464 

<210> 354 
<211> 446 
<212> DNA 
<213> Homo sapiens 

<400> 354 

ggaaatgcca caagactatg gccgtgcaac atttccgcag tgctcctcgc tacaaagaca 60 

ttcccctaag gctggtgggc aactcaacac tcagctcagt acgtggtcag ctcgtcctcc 120 

ataggagcct tatgccttgg tgaggagatc tctgaagaaa ttgctgatga aagtccaaca 180 

ggctcttcca gtttgtctgg tcggtcacat ttgctgaaac ctggaggaat tgttagtgga 240 

agctcaacag gcctgactca gtctgactgt ccattcttct ggaagctgca gagaaaagaa 300 

acctggaaac cctatatgct gacaaaaagg gacacaattg gatatgatgg ttattttacc 360 

aaggttttga aatgtcgtgc tttcaaatat aaacagactg ctttaangga tcnaaagtgg 420 

ccttttaaag ccaataaaag ccctgc 446 

<210> 355 
<211> 446 
<212> DNA 
<213> Homo sapiens 

<400> 355 

cagcccagac gtggtcaaca agaacactga gcagaaaaac aaccttgagg atgaaaacag 60 

ggatgttctc agttgaagcc cacactagaa gagctattta aacagcacca aagtgctggg 120 

attacaggtg tgaaccgctg tgcctgaccc agtgtfctcta aaatatctac aaaaacagtt 180 

tggagttagt cctaggcaat gctttgctgg aaatgggatg tgtgatggac cattctaagg 240 

gagctgaact ggctgctgtg aagacatcag gaacccaagt gagactgtgg tacgtaagtc 300 

aggaagaagg cacttgcctg gttttgaaaa catgtcctgg ggatggntag tgcctncagt 3 60 

tcacaaaaaa agcaagctgc cttgttaggg nanggannca accanttgaa aacacctcca 420 

ntactgccan tanaaacagt tgattt 446 

<210> 356 
<211> 450 
<212> DNA 
<213> Homo sapiens 

<400> 356 

aggctgagaa gtccaagatc gagggatctg gcagcagatg agggcctttt tgttgcacca 60 

gcccgtggca gagggtggaa gggcaagagg acaagaaaga ataataaatc aaacttacag 12 0 

cctcaagctc ttttataacc agcatcaatc cattcatgag gatggaacac tcatgaccta 180 
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aacacctccc tttaggctcc accttccaac atttgttaaa ttggggatta agtttctaac 240 

acatgatttt gggcgggata cattcagatc agaccaaaag ggcaaaggga ttttgtatac 300 

acagagaaga agttgatgtg aagatggagc agagagccgc ttgaagatgc tagccttgcg 360 

actggagtca tatggctaca atccaatgga tgctggtaac cnccaaaana tgganggngc 420 

ccggacnaaa attcncncct ggaacctcca 450 

<210> 357 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<400> 357 

gtccttccag aagagcactc cccatcaacc cgcgggcagc tgaattccca cctcagactc 60 

tgctccaagg gcgccgtgtc tacggaggcg acgctgagga tggcttatca ggttgggtca 120 

ctcaccccac cacgaggacc tgaccttaaa ttctcggtgc atcctaagtg tgacccagag 180 

accgcctgcg tcagaagcac ctagaatgct gtggaagcac cttcaatgca gattcctggg 240 

cccaaccctg gttccactga atcggggtca gctggtgggc ccaggaattg gcattttcaa 300 

cagcttccaa ttgtacacca gaatactcaa gcttgtgact cccctgctca ctgntttctt 3 60 

catcctttct cacttcctgc tgagtacata tgnattttac tacttttaaa aganactttt 420 

accaataaag gccggcnttg aaggggaaaa aaaaaagcca 460 

<210> 358 
<211> 419 
<212> DNA 
<213> Homo sapiens 

<400> 358 

gaccgcaatg ctcctacgat gatcctgtaa cagaggtatc ggacaccaac cntgggannc 60 

ctccttcaaa ttatgggaca tcaccaacaa tcaatcacta agagaagaaa taatttagaa 120 

gaagaattca tttttggtta ctcaaatata acccaattta aaggagactg ttatttctct 180 

tctctagtaa gctacagaca ggatctgctc cctttaataa gatgcttggt taataacatt 240 

tatttacaga gtaaaatttt ctctttattt ccctccacac taaaatattt acataaactc 3 00 

aaaccactta tgttgcctat tccaaccagt ttcttgtcag agtgagtagg aaaattcttc 360 

attaaatgtc attgcctttg gggnaaacag aacataaatt aaaaaccccg ctttattta 419 

<210> 359 
<211> 455 
<212> DNA 
<213> Homo sapiens 

<400> 359 

gccaagagat gcaaaggatt aatcatgaac ccagttgccc agaggtggaa aaaaaaaatc 6 0 

tgttgtggta gactgaagaa gcnagaagtt atatgaacac caagaggccg gcaacatgag 120 

tgtggcctga gtctgacgcc ttcgcccacc ctcttccaga tcacctgatc cgaaagaagt 180 

tacgaaaata gctcanaatc tgggcctgcc tggaagagac ataaagattc atttacatgg 240 

gaaggtgact gctctgaata tccacagacg acgaatctat gctaatggtt cagtctccca 300 

caaatctggg atttatataa ctggctccta cccttgttcc ttgccagcag aaatgcttga 3 60 

attatcttaa ttccagaatg naaattattc ccattctgan ggcntcattt ttaagctggc 420 

aaaggncatt tttttnacag gcctaaaaaa aaatt 455 

<210> 360 
<211> 465 
<212> DNA 
<213> Homo sapiens 

<400> 360 

atgatgtcag aagtggggtg caaagtanag gcttctgaca acccccggga gtactgagtg 60 

aacaagcaag gtatctgcag aacccacttg tgtccaccga tctctcagag tgcctggaga 120 

tcatggacaa cagaatgcag tgtgagggat gtcaagtcat ctgggaacaa cactttctta 180 

agaattcatc tcaatttctg cgttttttgg aaaggtcctt aattgtttgc tgcctctgca 240 

agctagacat ctctttcagc aaatggagac ccagatggtg aggcaagaga aggaatgacc 300 

aaattaatga aaatgttctt tcagcttgtt attgagcttg ntattctcct gaatgcttgc 3 60 
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tctgcgactg ntatgctaac tgaccctgtg ggtaaaanga gaaaggaata tctcntttgg 
ttaatttaaa aaatantaat aattgacaaa aaaaaaaggc ccccg 



420 
465 



<210> 361 
<211> 332 
<212> DNA 
<213> Homo sapiens 

<400> 361 

gctgtaggat gacgcatgat gcaagtctga agttgtatgt ggccatcttt gccaccacat 60 

tcagaaagct tacctgagaa tgaagtcaac actggagaga aagagaaaga aagagggaga 120 

acatatcaga atctctccac aatggcaaca aagatggtca ctagcaagtc caagcctcca 180 

ttctctttaa aacttgcaat ccttgaggac aaagaaaaac gatctttttt tccaatatct 240 

atgttacttc taaaagaagg nattaaggaa agcctgnatg aaatttcatt catnantcaa 300 

gaccatactg gccttgaata aaatttataa gc 332 

<210> 362 
<211> 293 
<212> DNA 
<213> Homo sapiens 

<400> 362 

ggagatcggt tggaaagaca gtggactgat ccaagagccc agtcttgatc agcccagact 6 0 

gaggggacct taagagatgg gaagactgac atttacaact tccccaactg gccgtgatga 120 

tcttaagtac agccactgag gaagccaact taagaatctc ttcctgaccc tgctcagaat 180 

tctatcatcc ttcttcctgc cccaaataaa attcccactt ccacaaaaaa aaaaggccan 240 

cgnggccaat tcagcttgga cttaaccagg ntgaacttgt tcaaaagggg ggg 293 

<210> 363 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<400> 363 

ttgtgcgtca ctgcaagact gcatggtaat gaagccaagg cactgtgggc caaaactctg 60 

ctgcctgtga gaagagaagg gacagcggct tggagagaca gaacggcaaa accgctgctg 120 

ctgctgcttc tgcttctgct gctgctgctg ctgctgctgc ntttgcagct gattgagaca 180 

ctatgttgag tctacaggat tctgtgtttt ttgaaattag cataaagtcc ttgttaaagt 240 

cctggagcag cagctgaagc caagtaggct gcccaggcag tcagaagaac agagcaggtg 300 

aagctgcaca gcatgcagtg gtgtgtcttc ttttggggcc aagcctgatg caacttacta 3 60 

tttgccaacc cccggtcatc ttccttctga gtaaatggcn ccactatcct atgagtgatt 42 0 

caagtaaaaa tgctcttcag cgccagtcag caaagtaaat aaatca 466 

<210> 364 
<211> 283 
<212> DNA 
<213> Homo sapiens 

<400> 364 

tcacgaacaa tctggatttc atgtcacaag aggaaacaga gtcatcactt caagtactgc 60 

accaatcaag tctgttctgg taataatgtg aggcatgcct caagacctcg atacatgaaa 120 

gcaattactg cagatgcctg gctgttggca ctgttcagct ttaatgtagc agtacagaaa 180 

gttatgcctt ccacctgtga tgactgatcc tagaacctgc agacaatgag tctaagctga 240 

atacaaacaa taattatcca agtaaagagc ccttgttcaa ttc 283 

<210> 365 
<211> 407 
<212> DNA 
<213> Homo sapiens 

<400> 365 

aaatgaagat ggcatatgga aaggcgattc ttatactcag aaggaaaagt tcccatggaa 60 
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gccatggatt cattcatgac aaagtgggtg gcctgtttgt ttgcttgaga ttggcaaaaa 120 

tccaaaatgt ctgtgcacac tgctggtgag gctatggtaa aacaattaca tatttctggt 180 

tggtgtgtcc ttgtgaagtg aaatttggca gtaagtaaca aaattactca tgcatttccc 240 

acggatcagc atctccactt gacataaaat aaatgctaga gatacacatc tacaggtatg 3 00 

aactacaagt tctgtagtat acaaggatac aggtaattta ttctgttgtc tatgatggca 3 60 

taaacagctt aaagtgctta ttaataaggg gcctgggttt gttaaag 407 

<210> 366 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<400> 366 

agcatgctgg acagcctgga gctggagccc acctacaacc ccttgcatgt tcaaagccac 60 

ctgtactcac acctgagcag catctatgcc aagcctcagg ggcggctcca cccacactgg 120 

gagagccgag ctccgagaaa gcatccctgc aagactgggc agttgcagac caaccgagct 180 

cgagctactg tggcccccct gcctatgact cctgtcccag gcagagcctc caagatgcca 240 

gcagccagca aatcttcctc agatgccttc ttcctgcctt cagagtggga gaaggatccc 3 00 

tcaaggccct aagtcaccag caccagagcc cagctgccca gcttaaccat attcatgctc 3 60 

aggttcacat aatgggctat ttgnggtcaa gacttgcttt tttttccccn ggganccttt 420 

tntgngggag ggnttnattg ggaaaanaaa nagcctttcc ttgtcc 466 

<210> 367 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<400> 367 

ccattcccaa atgcgttacg taggtggaag ctgggtgagt gtcaggaaac taaactctgc 60 

aaaataagat gacaccctct tggaagattc ggaaaagtgt atcagacttc aagagccagc 120 

tcagctacta cttcaagcta acctttcttg agacctcccc tttacctgct ttcatctgtg 180 

ctgcccgttg acttaactga atcacctagt ggactgaatc tggccaaact ccagggccac 240 

ctatcatgag cagccttgtt tgctggcaat ttgcagagtt gcaaggggta aaggactggc 300 

tttgactatt cagtctttca gttcatcaca tcttgcctgg atgactgcag tggccactaa 3 60 

gctggtcaca gagtgagctt tcttaaatgc aagtgtnaag gatngnnaaa ccctcaaggg 420 

gctttnantt tttccaaggg ccctgtncct tggaggggca taccattgaa gggta 475 

<210> 368 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<400> 368 

ggctgggacg atgaaatgtg atgggctggg aaactcaagc cngcccccag gtgggaatca 60 

ataaagggga ncgggtggtc tttggcttat tggtntggcc caagcctggg tcttcaaaac 120 

ctgggccctg gaaatcaaat ggctttccca ccctcaagct tggcccagaa gggaaacccg 180 

ggggaattac cagggccctt gaancccact ggcaggccca gcccaggtnt tggttaattt 240 

tttaaatggt aaaaattctt taantaaaaa caaacctcaa gggaagctct ctttgtcncn 300 

ttttaaaaan cccattttna aactttcttg cttaaatccg ggaagnngta atatttcaag 360 

nggcaaactt ttggaattct tgtggcctcn cttggggaat gccaattccc ttcaaagcct 420 

tgggcnccca aaaataaaag gtcttcccgc ttgattattt aaaacc 466 

<210> 369 
<211> 475 
<212> DNA 

<213> Homo sapiens 
<400> 369 

aagccaaaga ttttgcagaa tcaaggatgg atggagtatc aaaataagga acggaaaaaa 60 

ctgaagatat actaaggatt aaggcccagg ttcatctagt gtccccaggt gccaggcatt 12 0 

gtgctgtgac tgtgatgtga aaaaagaggc caggacaact gggtctcatt cagtcagact 180 

ggagtgcagt ggtgtgatca cagctcatgc agccttgacc ttccagactc aaacaatcct 240 
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ttcatgtagc tgggaccaca ggtgcatgcc accatgatca gtttattttt aaattttttg 3 00 

tagtgagcca ttgagtccag cataatcctt ctaatttagt tccttatctg aaaagcgagg 360 

acattgtgac aatgatctca gaacactgtt gngaaaanta aantctnaan ataaagggtn 420 

ggggcccaaa aggctttaat tggaagttgg cttaanctat aaaaaaaaaa gggta 475 

<210> 370 
<211> 387 
<212> DNA 

<213> Homo sapiens 
<400> 370 

ccctgaagga ggtgctccag cggcctgctc cgtcctgtcg gaggcttcct gaaggcctgt 60 
gttctcacct gcccttagtg gaaaccttct attcatctga tctattttct tgtgggtgtc 120 
agggcccaca tgtctccatc tccctttcca gctccaagat atctgttatg ggctgcattg 180 
tatctccaca aaattcatat gttgaagctg atatgatttg gacctgtgtt cctgcccaaa 24 0 

tcccatgtca aacgccatgt gatgtgtgct ttccctttgc cttctgcatg attgaaagtt 300 
tcctgaggcc tccccagaac caagaagatg ccgcatgctt cctgacagtc ttcaaaacga 3 60 

tgtgtcaatt aaatctcttc tctttac 387 

<210> 371 
<211> 462 
<212> DNA 

<213> Homo sapiens 
<400> 371 

gctggagtgc cgnggcacga tcttaactta ntngnaccnt tngcntccng ggttnaacca 60 
nttttcctgt ctcagcttcc ccagtagctg gggattacag cgccctctgg taggcattgc 120 
agagagaaga atgcaaatta aataagaaaa gccctctgcc cttcaggagc ttttggtgaa 180 
gatctctttt ttaaaaagct gcaagactgc tgcccgaagt gggacacaca acctaaataa 240 
gggcgagaac cggcaaggac ggcccagcca cgtggaaccg cctcgcaact ttggcgagca 300 
acttgagatc ttctagagac ccaggagtat gttgcttcta cctcagactg gggagagggg 360 
agcttcccca aaccattggn gggagatgaa natntcaacc anccgaattc ctgttcacga 420 
ccaacctgtt gtgagctctt ctgggggatc aacaatggct ga 462 

<210> 372 
<211> 263 
<212> DNA 

<213> Homo sapiens 
<400> 372 

ttttctntat gaaaactact nntcacantc nnantccttt nangttaaaa antnaaaggg 60 
naggccagnc ccgggggttc acccntgtan tcccagcact ttggaaggcc aaagcaggtg 12 0 

gatcactgga ggctactttt tgttccttca atgcctattc attcgtctcc tctactcccc 180 
gcttcccctt ccttcataca ccaactcaga gttcgaggca cctgcccatt tccttccaaa 240 
taaaactgta aagaggttac aat 263 

<210> 373 
<211> 230 
<212> DNA 

<213> Homo sapiens 
<400> 373 

gaagtcaagt tgattacttg gcatcagccc ttcatcacag atactactga aataaaaaac 60 
caagggaatg tgaaaaaaac ggaaggacac tgaagcccgt ggggaaataa tgaagtataa 120 
gtgcttcaga gagcagcaag aaatggaata atatttcttc tgtgaggacc tcagtaataa 180 
caacccatga gtgatgggac ttattgcaaa tggcaagagt gctgttggag 23 0 

<210> 374 
<211> 338 
<212> DNA 

<213> Homo sapiens 
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<400> 374 
ncatngtnng ggagttgntg naaccactgn 
tggtcctcna cactgggtgg ggagccctac 
tcggnntctg cntatcacag ccatacaatg 
tgctcagggc ttaccaaacn gtgacnatag 
nttcggnatg atctgacgga anggccggca 
acctggcaca tgtctggaac ttcncattac 

<210> 375 
<211> 412 
<212> DNA 
<213> Homo sapiens 

<400> 375 
caacctcgaa aatgtccaac tgcaaagacc 
atggctgtaa cttccttggt gccgaggact 
gtcacttaac taaatgcaat cggcccagct 
accagcccct gagcttccgg acaggaaaca 
tcatgccagg ctccacaggc aagaatcaag 
gttccctgaa atcagcagct ggggagaatt 
tcattacatg tcctcttgca gcagcagcaa 

<210> 376 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<400> 376 
ctcagggccc taggggagtc acaaaagatg 
agaagactct ctcaagaaca actgtcttgg 
ccactcagga ctccctaccc tgccccacaa 
acggaacagg aaggcgtgag ctttggcacc 
cagggccttt gattatagaa aaaaataggc 
agcagcatct aagaaaccag tttctttgtg 
tanggtgggg agggaggagg ggtgaaanng 

<210> 377 
<211> 253 
<212> DNA 

<213> Homo sapiens 
<400> 377 

tcaacagtca taactttttg aggacacatg tttattgctg ctgctggggg 
tgtacccact ttcaaatggg ctgtggaaga gacaaagctc atctggctgc 
gcatcctcat gcaagctggt ctactggtgg ctgcccctgt gacctgcttc 
ggcaggaaaa gtctcccact gtgttgcatt taaagaaaag aaaaagatga 
aagctctgca aac 

<210> 378 
<211> 303 
<212> DNA 
<213> Homo sapiens 

<400> 378 

gctgaaatga accaacatca gcagaggccg cggcagagtg 
agaagccctg gtctttgtct ccacaaatgc tgaaactgac 
agtctccatt agccaagcca agagcagagg aaatgttctc 
gtcgacacca gaaaatttcc tgctggaatt ctgccaaaga 
ggtccatcat tacggaactt tgctgtttgt aaatttaata 
aat 

<210> 379 
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ctgactcttc atancaccnc 
nttccatgaa gncttggcaa 
actcttcagg aggaaatacc 
gtgaggtgna gccagactag 
gaccctatat cctcagatgt 
agaggggg 



gcttttncct 60 
acngggtgga 12 0 

agcctagacc 180 
actnacacca 240 
gtccccatcc 300 
338 



catgtctaca aattgctgtc 
ccctgctcag ttcctactta 
gcaggcacca ctgctcgggc 
gcatctgcat ttccagactg 
cagatggaag ctacagagga 
tatcttacaa gggtggaatt 
aagtaataaa aaataagagc 



agccagagga 6 0 

cagtatctga 120 

cactataaga 180 

tagcagctca 240 

aacaaacagg 300 

cttgattctt 360 

cc 412 



aggacacgtg aagactacag 
attcccacag ctttcccctt 
gcctgcagat tctgagatga 
agtttaacgt agaactgtac 
ccattgtctt ggtgggtgga 
tccagtgatg agggcttagc 
naaacatact ttaataaaat 



ctgcaggcct 60 

tctgtggtca 120 

cctggaagga 180 

gggccaaaca 240 

accaaagcat 300 

cctaaaatat 360 

agatta 416 



cagctgctct 60 
tggggcagtg 120 
tgaatggcca 180 
attaagtaaa 240 
253 



agagagctgc 


ccatgctggg 


60 


agtgtttctc 


ccagagtcca 


120 


cactggagga 


aagaagaac t 


180 


atagctggcc 


gtcctaggga 


240 


aacgactcac 


atctgcttat 


300 



<211> 382 
<212> DNA 
<213> Homo sapiens 



<400> 379 

gtgtggagca gagaaaaggc tatacccact gatgaacagg gatccacacc tggggaagaa 60 

gcaagtatga ctttctctcc tgtggcttta cacaacctcc ttgaaattcc aagagcaacc 120 

ctcccagcta aagtcttctc agatgtgaca cgatctgcac aagcagaggc ggcacagggt 180 

ttggcttcca gttgggaaat gaagctccaa gggcagccct actatggcgg gctgtgtgac 240 

ctgggccaag ccccttgaca tctccagact cggcttccac atctgccacc accaggacac 300 

tggattgaat gttgggtacg ttgtaaggca agggagacac agaagtccta aaggcaataa 3 60 

agcttttccc cactgcccct cc 3 82 

<210> 380 
<211> 364 
<212> DNA 
<213> Homo sapiens 



<400> 380 

agactgggtc tcactacatt ggccaggccg gatttgaatt cctgggctca gcctcccgag 60 

tagctgggac tacaagcatg taccaccatg cccagttttc tgcagcagtt tttataaacc 120 

aaattttcca aattagaaag actgaccaaa gaagcacttt tatacgagga ataacttacg 18 0 

fcatggagaat ctcaacttgg accagtcaag accaactcca gcgatgaagc cagaatgtaa 240 

tatatctcaa aaggctaaag aagtccattt tcccagatgt aaattataat taaaaaatag 3 00 

tgagccaaac tctaatatcc caatgtgata atctttcaaa taaaaatatg ggctgtagtt 3 60 

cagg 354 

<210> 381 
<211> 318 
<212> DNA 
<213> Homo sapiens 



<400> 381 

aaatgttaag ggagttaatc ttctacaagt ccagtcatgt gctttcacaa agggccaaga 60 

aaggagtccc aaagctcgcc atgactcaac aggaagctct ttgtgtcttc ctttctacac 120 

catgtctgac aaagaagctg tcttaagttc atgggcctct gtctcttgcg tgaattctga 180 

agtcagtgaa gcaacaatga tgtcattgct tctgaagacc actgttggct gagataatga 240 

agatctcttc acccaaaaca ttgccatttc tgcagcatac atttcctacc ctttcaaata 300 

caaaagtatt ctaccgat 318 

<210> 382 
<211> 463 
<212> DNA 
<213> Homo sapiens 

<400> 382 

ccagcagaca tcaaggactt ctgaggagcc tggtaccttg cataggcact atggaccctg 60 

ttttgcttaa cccacccaac agccaatttt agcagacatc ctagttttgc aggtgagaag 120 

agctgaggta cgaagaagtt ttgttaattt ttccagttca cgtaacaagt aaatgggaaa 180 

ccaggatgaa aatcaaggtt tatctgtcgt cagactgtta ctcataatca ccattcggag 240 

agttcanatg tgggacaaga ttctaactcc nnccttctcc caaatggtta atntgccagg 300 

tgccctanag ctacatattg tcttatttgt gtgatnnact gannctgnct gaatnttana 360 

agccttgtat cttntgnant nncaaanaca naagagnccg nggggnntat ttaaattnga 420 

antnaaccgg cctgannngc cnaaaanggn ggggcttccc agg 453 

<210> 383 
<211> 220 
<212> DNA 
<213> Homo sapiens 

<400> 383 

gtggggtctt tcagtgagga cactcaagca gctctgtgga gaggaaccat cttgccagct 60 
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ccaacatgcc agccatgtga acaagcccag gtggcaaatc acccagcctc agtcaagctt 
tcagatgacc acagccccag ttgatatctg actgtaacca catgaaacac caaactctgg 
actcacagaa atcatgagat aataaacaat gattgttttg 



120 
180 
220 



<210> 384 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<400> 384 

gcaaagaaac aaagaggaag gtgtggatgc tcacccagaa gtcttgtctc ctcgcagtcc 60 

cttagaagct caatcctcag gagacagtgc actgggggtt gccaagggga cctgaaatac 120 

cggtttgcca caatcctgac caaatcggct cccagggctg agaagggaga aggtgtcagt 180 

ccattcaaaa cccatcgtgg ctgattttga agtggaaaaa gaaaaaaaga agcaaagaaa 240 

agcattgctc agcaatgggc aggaagaaga gttaagaggc tgagctcttc ggcaagaaat 300 

gccatagctc tttcaacttg gacagagcca ggaccacagg ctggttgtgt caaaaactgg 360 

gtgttcttgc ttagtgcata aggtttggtg gttttcctcc ctctttcctt gagccctggc 420 

acttggggac cctg 434 

<210> 385 
<211> 268 
<212> DNA 
<213> Homo sapiens 

<400> 385 

attgtgaatg ccagcagaac agctgacccc aaacagcttg aagaccccca caacagaact 60 

gaatcagcat gaaaatgcag tttctccacc tctctgttcc atgacttcac cctgcactct 120 

tccaccaatc aatggtctcc acactttggt cgacaccaaa acgcttaaga acccaaccct 180 

agccccaaat tccttgggga gacagatttg aggagtcctc ttacctcttc atttggcagc 240 

cttaaaatta aaactctttc tttgcttc 268 

<210> 386 
<211> 542 
<212> DNA 
<213> Homo sapiens 

<400> 386 

gtgacatggc ttacaaggct acttgtaatc aacttctcat ggctcatccc catttgtgcc 60 

ctgaactcca aacgtactga gttacctgca gttcctgtaa tccagcatga ctttgtcctc 120 

caagcctttg ctgtccccac tcatccttca gttcctagct caggaatcat ctccatcaag 180 

gtttccctga cttctcccat ttcccaagtg aggcgttcag agagtcctgt gcttaccttt 24 0 

ggggtagcac ttacatcctg ctccctaact gtctgtagaa tcatctgtct tcgctgtctt 300 

tgagcaccct gagggcaggg actgcagctg ttatctgggt acatacaaca ccaaataaca 3 60 

atgcctaagg catgccagat attcaataaa tgtctgtgta agaagcaaat gtttaaacat 420 

ttccttcccc agcatgcctt ctctgactat ccccacctcc ttccagaagt actcacctaa 480 

tccatgcgga caccatagac caagtgcatt tataaaactg gtttataata ttaaatgggt 540 

ag 542 

<210> 387 
<211> 282 
<212> DNA 
<213> Homo sapiens 

<400> 387 

gtatantant tcttatangn nngnnnnnnn nnnnnnnnnn gggatgctcc ttcctggacc 60 

cagccaccca ctgggaaaag cctaagccac gtggagcanc tacatagaag agggccgggg 120 

ccacagctac agccagcagc tcctgccagc cacgtgagag agctaccttg atgttccagc 180 

ctccagagat ctaagagctt ccagacatct accaccccag ccacaccacc tgagccaatg 240 

tcccacagag tcatgggaga taataaaagg ctgttgttct ct 282 

<210> 388 
<211> 263 
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<212> DNA 

<213> Homo sapiens 



<400> 388 

aggcaagttc tccgttgccc aagctggcct ccaactcctg gctcaagtga tcctcccacc 6 0 

tcggcttccc caagagatgg ggttacaggc atgagccact gtgcctggcc tcacaagatg 120 

ttgttatctt tgttttacac tatcaatgcc catgcgtcct tacttaatta ttaaccactg 180 

tattgctgtt cattcttcct gcatctcata tcttccatca gggatcattt ttcttctaca 240 

taaaataaat catttgtaat ttc 263 

<210> 389 
<211> 292 
<212> DNA 
<213> Homo sapiens 



<400> 389 

gtaatgcttg tggtgttcca gacagcagaa tgtgagtggg acatcatatg taccacctct 60 

gggcctggac catagaactc acacataatc cttcatgttc ttatgtgacc acacagatga 120 

acaaagcaag ccaagtgtgg aaacgtgtta aagatgacgg aaccacaaga tggaacaagc 180 

ctggatccct gaatccctcc ttggaggatt agtgcccaca aattgtaaac agccacccag 240 

atctcagcga gcaagaaata aattatacct gaatgtttta aaaaaaaaag gc 292 

<210> 390 
<211> 244 
<212> DNA 
<213> Homo sapiens 



<400> 390 

gattgtctcc aatttacctg gaccacagcc agcaccgtat cctcaggcac cccatgggac 60 

agtacataca gaagaacagc atcacaccac atcctatcac caaggccagg attctgtgcc 120 

tccgcccccc tccccacctc cttgaaacgg gggaagtagg gggaagagtc aattcttctt 180 

ggagcacatg agatggtagc ttgctgtgtt gtcctgaaag aaaacaaagt ttgtaaatca 240 



<210> 391 
<211> 436 
<212> DNA 
<213> Homo sapiens 



<400> 391 

ctgaggaata tatgattggt ttcttggaac aatttcacag ctggcatgga actgaaaccc 60 

tgctactcag gggaaattag gatcagctct tgtccagttc aagctgactc cactgagcct 120 

ccaatggcct gtatgaatgc ccaatgagtg cccttttgac atcagaaggc caaaaactcc 180 

accctcagat tgtgccaacg acaccatctt gcgaacgtgg atcctatgaa aagccatgaa 240 

gcttaactgc actcgcacag atcagcaatt acctcacttt tccttaccac caattaactt 3 00 

tttccatgca ttggctgcct tgcttcttta ttccacaaaa atccttatgg ccccactttc 3 60 

aaggagggag aaatttgagg ggngttatcc cacctcctca cttggctgcc tcatgaataa 42 0 

aatcttttct ccctgc 436 

<210> 392 
<211> 178 
<212> DNA 
<213> Homo sapiens 



<400> 392 
aggctgttgt gatatcctgc atggacaagg 
gaacactttg atgcattggc tatgattgtc 
ctgtcccagt atgagaacat ggaaactcat 

<210> 393 
<211> 263 
<212> DNA 



aaatgatgtt catctaatac acccacttgg 60 
tttctgtttt ccctaccctc atctctagcc 120 
tttggaaaat gtgaaatgag tgatcccc 178 
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<213> Homo sapiens 



<400> 393 

attgatcgca gagttgaaca acagagaatg tgtgcacagt 
agattcacct gggagaggaa gtgggcaggg gcagaggagt 
gctgcatggc aggtgaccat tatcaccagt gtcgccagcc 
gtggttgcaa cctccatgat tccctagagc tgtttttacc 
gcacattaaa ttcatgctat tag 

<210> 394 
<211> 267 
<212> DNA 
<213> Homo sapiens 

<400> 394 

ggcccctaac agtgtcatag gcctgatgga gcagcggaac 

agttcctcag aaaccagacg gccttacagc ctcttcactg 

gaaatgcaga tgagtgcttt ctgctacaaa tctcatctct 

aacatgccat cactgtaact gctaaaaaca caacgtataa 
tgaatctgtg aatacagaat agcctac 

<210> 395 
<211> 180 
<212> DNA 
<213> Homo sapiens 

<400> 395 

gcacacatag ttccttttgg cgtcttatct tctgaagctg cctcaaggcc aagcaaagaa 60 

agttgttaaa aagttaagtt acttttcaca gcctgcaaac ccttcaaagg caagaactca 120 

aatagaaact tggaaaggca gataagccag aaaagtgtac taataaacgc acttaatatg 180 

<210> 396 
<211> 428 
<212> DNA 
<213> Homo sapiens 

<400> 396 

atgacactgt gagaagtcag atgtatcatc tcttttgatt accactggtt ctccaggacc 60 

tatgtcataa aagattagat caacctgtaa ccagagccta ttaagtgatc tccagcaact 120 

gtctccgagt tggaagtgct agccaaagaa tttcagtgat tgcgttttgt gtacttacac 180 

ctgtgggacc agcactctcc atttaatgag ccagctgctt ttctgattgc ttccccggat 240 

ggccaagtca ctgcagaagt ttcttgaaag ctcaaatgtt gccttttcct aaactaccca 300 

tggccccacc ccacctcatc ctgtgcctat aaagacccca gactcaatca gcagagagga 360 

gaagcagctg aatgttggag agaagggact tgacttcaga gggacagctt gatggagtaa 420 

ccggagaa 428 

<210> 397 
<211> 285 
<212> DNA 
<213> Homo sapiens 

<400> 397 

aaactctnat ctcttnccac tgnctntgtt attcaagagt ttgtttctat ggnggagcta 60 

atgagtctca tccttgcagc taatcaaatg tacnanagca tcaacagaat taagatggtt 120 

ancgaggtga ggccttgaaa tcaacatctc cgcctccttg cataaaccct tcattgagac 180 

tcctcttcca tttgggcaac ttgatgtggt tcaagagcat ggagaattga tctcttaaga 240 

ctcataaaat atttgcttct tcaaaaagaa taaaggaact gaaac 285 

<210> 398 
<211> 169 
<212> DNA 

<213> Homo sapiens 



gccaggcaca ggtgtggtca 60 

gtgcccacct gagctgaaag 120 

aggtcacctc tctgaatttt 180 

cagaactaat gaaaaattct 240 



ctgcctgagg gtaaagctga 60 

ctctttgaga tggaagagaa 120 

ccaagctgaa gttgccaagg 180 

tgaaatgcat cttctacaaa 240 
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<400> 398 

gttggagatt acatgtctaa atcttgttca cacctatggg attggacaaa attttctcat 
gaaactaaga gaacaggcca cagagtgtct tgcaatctat gctgctagca agtgtctttc 
tcatgcctga tgttatacaa aaactagcaa taaaggctta ttctttcct 

<210> 399 
<211> 224 
<212> DNA 

<213> Homo sapiens 
<400> 399 

gaggaaaggc tggaccctgt atttgtgttg tgtaccctca ctctaggagg tgtcttcaca 
ctaagagatg gccactcagc ttctggcatt atcactctgc atctactttg ccaagcttct 
tcttttgaaa cgtcttgtgt aggcagtagt taagaatatg ccacccagaa gaataccaga 
tgaataaact tacaaatatt ttgaataaag ctcaatctaa caat 

<210> 400 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<400> 400 

gagctgatac tctattaatg gatctagtgc ctaaatcaaa agaacagaga gagtctgtat 
aagcaaaatt acctgaanaa aggtncgaaa aactggtccc aggnccntaa aatgctgngc 
tnnnaaaang nnatntnggn nnaaaaaacc ngnnancccc ttcntccccc ntccagaaac 
ctanaattna cgttctacna cttccacaac ccaattccaa cttcctttnt taatatgtgt 
aangngtatc tgccccatgg gccttctgga tgtgttcatc aattctgaaa aactctgaac 
tcggaagctc agtgagcccc agggtttggg gtaagatatt acggacctgc ncttnagcca 
aaagtgcttn cgctcactct actactgnnc tactgnncct gacggnngat gtcccncaaa 
gccnccttgc tgtggggcag gggggccccc tgtccttttt ggggaa 

<210> 401 
<211> 350 
<212> DNA 

<213> Homo sapiens 
<400> 401 

gtggggtctt tcaagctcag gaacaaagcc ttagtcccta caggagaaag gcaatcctaa 
ggagagcggc gcctgaaccc tttcctacca tcaagaactc aagaactcag cctaataaat 
gtgggcagaa ttcacataca ccagctccag gcctggccca taacacttcc tgcatgatct 
gggatgcaaa cgatccagtg gaggcctccg aggccctaag gatgaagcag ctggagacag 
aagggcctgg gtccctgaat ggctgggagg aatagagccc cagtgcagtc tacttgcacc 
cccaccttga ctctgacata ggcagaaata aatttttaca ctctaaaatc 

<210> 402 
<211> 133 
<212> DNA 

<213> Homo sapiens 
<400> 402 

agatgtatca aatgggagac ggccagcagt gatcaagtct tgattaatac tgaaaaacag 
aagcttgtgc tcacaatccc tgccattaca attctttata gtatgtaagt actttaataa 
acattatgaa gcg 

<210> 403 
<211> 330 
<212> DNA 
<213> Homo sapiens 

<400> 403 

gaaggaggat atccctgcga tcaccaagcc tctaccctta tcttccaaac cagtcactta 
ccacagatgt cttgtcaagc tgaatatcct ccagatctga cttctttcct ctactggtgc 



87 



tcaatacaag atgctttact ttgtcacaag aagcatataa taaactcaaa gctgcaagga 
tatatctgta agggaaattt tttcttgatc tggctggcct tgaacataat caccagaaag 
actttttgtg ctcagatatt atggttgtaa atgaggattt ttttcctcac ataagaatgt juu 
atctagtcca ttataaaatg ttattgatgc 330 



<210> 404 
<211> 242 
<212> DNA 

<213> Homo sapiens 



<210> 408 
<211> 419 
<212> DNA 

<213> Homo sapiens 



180 
240 
300 



<400> 404 

tcctgtgcct ataaagaccc cagactcagc tggcagaaga gagaagcagc ttgactggag ou 
aaagatgatt cgacttcagt gggacagcta gactttggag gacagacggc ttaacttcag 120 
ggaagagcca gctagtgaca accggacttc agggaagatt acctgcccaa cctgacccct 180 
ctccagctcc cctctctgct gagagcaact tctatcacta agtaaaattt tctacctcca 



240 
242 



<210> 405 
<211> 289 
<212> DNA 
<213> Homo sapiens 

<400> 405 

atgggaaact gaggtccgtg aagtcacttg cctggatcac acagctcatg accagtatgg 60 

gtcggcctgg gacacaggca ttctggggct caccaccagg tgttccacgt gtcaccacta 120 

gacctcccaa ccagggagcc ctgccgctgc cccagcctgg agacgtgaca cttctcccag 180 

ccaggaggct ccagtgaaac cagggattcc ccaggctcac cctgactcct catcttgtta 240 

acgtatttaa tcctcatcct gtacatgaaa taaatatttc atctcatct 289 

<210> 406 
<211> 436 
<212> DNA 
<213> Homo sapiens 

<400> 406 

caaaaggaaa gtcacagcca gagaacgtga ctcccggtga gcctggagcc agcgtgactg 60 

cagagggcca gtccccaggt gatgccggta cgctggagaa ggcctgggaa gatgtgcgga 120 

gacagacacc tgggacacct aaggaccaag cccagagcca cgctgctgct ttcccagctg 180 

ccactgggct gcatgaaggc agaacatctc cagtgagttc aacattcagc tccaacctta 240 

agcctccacc atggccaaga aaggcattgc tgctggggga gaaatggaca ttaacactgc 3 00 

ttcaaaaggg tgctgaaaaa cacccttcat ccccgatggc ttagcttgtg gaattcacgg 360 

gtacttgcat ctgaccctca tgagtctatg tagaaaaacc tggttgagga actgtttatt 420 

gacacccaca tcagct ^ 

<210> 407 
<211> 179 
<212> DNA 
<213> Homo sapiens 

<400> 407 

atatgtttgt ttattcgaac aggatgcagt ccagtcttgc tgacttagga tgcagcaacg 60 
aggcactatc atggaagtcg aaactgggtc ttcaccacat accaaacctg ctggtgcctt 120 
ccttgatctt ggacttctca gcctccanac cngtaaggaa ataaattctt tttttaaat 



179 



<400> 408 

agcttgtttg aagtgagtgt ggtctttgct cacccagaaa cagttgagga ttgccacttc 60 
ctagctgcga tatgcccaga ttgttttaag ccagccaaaa acaaacagtc tgtattcact 120 
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agaatggcag ttatgaaagc cttgaataag ataaaggaag aggatttcct taagcagttt 180 

ccttgtcctc caaactcacc aaaggctgta tgcgctgttc ttgaaattga atgtgctcat 240 

ggtgctgttt ttgtagctgg gagatataat aaatactcca ggaatctacc acaaactcct 300 

tggataattg atggagaaag gaagctggaa tcttcagtgg aagaattaat ttcagatcat 360 

ctgttggcag tatttaaagc agagagtttt aatttttcat cctctggaaa aaaaaaaag 419 

<210> 409 
<211> 409 
<212> DNA 
<213> Homo sapiens 

<400> 409 

gaacccagtg gctctgagct cagcacgcga tgcacccagg aatgtggcct tacgttgtta 60 

ctgtgcccaa cctgcgaaaa ctgggaagaa atgaagaagt catcctcttc ctgagacaga 120 

gcccagcagc cttggggcgg ctgagagaag atgggatcca cgtggcccat agcgcacccc 180 

acaggccttt tctgggaaag cagtcttctc tcggggaagg gagagacacc tgccgaggac 240 

ctgccagggg ctctcgcact gacgctgctg tccttaatgc ctcaacagta caggcaacat 3 00 

gggctacgct gagcccctgc tctcctggaa gtctggtatt ttggtatttt ggcaggtgcc 360 

aggcagaggg tgcctaagac cagccccata aagtccctgg gccttcccc 409 

<210> 410 
<211> 443 
<212> DNA 
<213> Homo sapiens 

<400> 410 

gccagcatgc acggcgcaca ccgtanctgn cgtctggagc tccagggttg ggggaattgt 60 

gttacgcatt gcctgtcact aggtatgagg ctgcctccga tttccacact nagaatcang 120 

gctgcagngc cctttgtgcc catggctgnt gatgcacaca ggattcttnc aaaacaagag 180 

gccctactct gtgactgtna gccttgccat caacactnct ntttggagna nagctncctg 240 

ntggccctga ggcaggagnn ttctgagatc ttnacntatg ctgggcttga tccangcctc 300 

antacaggtg aagaaacgga ncctgtaaaa ntgaagtggc ctgcttaagg gccngggctg 360 

aaagtctgag gcctggtttn aanccaaacc cnggcaaggc ttttgagaac tccaccnttg 420 

ctgccatctt acgtccaggg agg 443 

<210> 411 

<211> 96 

<212> DNA 

<213> Homo sapiens 

<400> 411 

agattggaga taacttcaat tggattatgc ccctggttcc ttatcctgac acttcctgga 60 
tgatcccatt acaaatacat gtgatgacat ctgttg 96 

<210> 412 
<211> 306 
<212> DNA 
<213> Homo sapiens 

<400> 412 

acaggaaata tgctgacacg ataataagat gtgagggagg cacatcttaa acttttgtgt 60 

gaagacccaa tcatcatgct gacgaatcac aaaaagatca gtaaagccca cccactctca 12 0 

caggtggtgt cactgtggct ccatcacatc agctagacct ggccatgcag tcccaacttg 180 

ttacctacag ttccagctgc caactcaggc catctcactg aatgaaatac ttgcttcaac 240 

attgaagatg tttcctctgg ccactcagag gaaacaccct ataatgaaca ataaacaaaa 3 00 

ggactc 3 06 

<210> 413 
<211> 219 
<212> DNA 
<213> Homo sapiens 
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<400> 413 



cttgcccccc acttcctctc tccctctttc 
gagccaggtt ccacaatgct tgatgantac 
gaagaaaaga aacctggatc tgcaagtgcc 
ggccactnct tcttggacca tccttaataa 



ctatgggctg gaatattgtg gatttggant 
aatnttttca ngaanacagc anaacagcat 
taagcagtga gcaagacccc accaacactn 
agttatttc 



60 
120 
180 
219 



<210> 414 
<211> 457 
<212> DNA 
<213> Homo sapiens 

<400> 414 

atccatggtc cttctcaaga cattggcttt gttctgaagc agctcccacg ctcttccaga 60 

aatctctatg cgggactctg aatgtggtca agaagaagat gtactggatg cacattccct 120 

atcaggagtc tcttaatagt ctcccaccca gttacaacat attgctgtaa tcccacacaa 180 

cagctgaaac atcttttctt catttctttt aattcctgta gcatttgatg tctccaccgt 240 

gtaatttaca tttaattgta agttgttttg catcatttaa tagttgtttc aagtatgaat 300 

gtcttgcctt cccaagaaag attaaaataa gaatccttta aaaacaagag cttactggng 360 

ccagggccng acttagactt agagtaaacc ncaactactg gcttcacttc aagctgacct 420 

aaccatcttc ccagcgaaga cggncaacct ggaacta 457 

<210> 415 
<211> 356 
<212> DNA 
<213> Homo sapiens 

<400> 415 

gcccggaaaa atggaggtta acttcattgt catctgtcat ggaactgtgg ccacaaaaag 60 

aggccgtctc tcaggccagg gtggctccac ggtcccagcg acatgcaggg gctcctcttc 120 

tccactcttc tgcttgctgg cctggcacag gtaatggcac cgaagcctcc tttcgctatg 180 

tttgaacagc gccacgcttt cctatatatt tttatagcag agcctaaggc acagcctggc 240 

acaagtgcgg gaaacaagtg tctctncatg ccagctccaa gcggaggctc aacttttcat 300 

tgntggttgn caaaaggggc aaanagcccc tgggaaaaac caaattttga caggga 356 

<210> 416 

<211> 99 

<212> DMA 

<213> Homo sapiens 

<400> 416 

gttctgtttg ggctctctgc ttcctcctaa agaagctacc aaactgccac ggttacactg 60 
ttttaatcgc cgctcattaa aagaaacact gactgggtc 99 

<210> 417 
<211> 173 
<212> DNA 
<213> Homo sapiens 

<400> 417 

ggccagacct ctgcagaagt ggtgtcaatc acttactcct ttccataagc tcactgcaca 60 

caccacttat gacacagaag actctaccaa aggaaatcaa actacagaac agcaacaaaa 120 

ctcaaaannn gnncatttgg cttttgtgtt attaaaatat tttctcagca gac 173 

<210> 418 
<211> 463 
<212> DNA 
<213> Homo sapiens 

<400> 418 

caacaaaaaa tggattaaga cgccaagagt ggagagtccc tgcacaaact ggattcaaca 60 

aggacagaaa ggaagcccaa acgctttaca tattgcctgc tttacacccc aggctcaagt 120 

ccagaaagtc cctgtgatac aactctccag tgatttcccc tgaggtccaa cctagtaggt 180 
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gcttaaaaag tctttgttgt aaattaataa attaatccaa aaccaccaca ctgctatttc 240 

ctcctaccta tcttcctgtg cctatcataa gctgtatcac ctggggaaaa aacatttttc 3 00 

agctaaattt agaacaggga gggttttggt ccataattcc acttctagta atagattcta 3 60 

aggaaataat cagatttaga taaagatagg ngtatgataa tattcaggca atggggtttt 420 

caatagtgga aaggtgggat caacctaatt tgaaaaatag cca 4 63 

<210> 419 
<211> 474 
<212> DNA 

<213> Homo sapiens 
<400> 419 

ctctttactg gtgagaagat agcaaaagct gaagcagaca cagaatccac aagtggaaaa 60 
tacagcagtg ccattaaagg agtgggcatg tggcctattt ctggccctat gaagcaaaag 120 
gagaggtctg ctgggagact tcctgaaact gctcttcctg gaaggaggga aacaaacaaa 180 
acaacaacaa aagaacttta caagagaaag ctttttatcc cagccccttc ctactcccat 240 
tgaatgcagc tctgtgagga cacgatattt gaagctgcag tagctgaggt ggcaaaagat 300 
ggcagaacag aagagcagac agaatctggg tcctagatga cttcattgca ctgntgcaac 3 60 

tgncttntnc aganctnttg gcnnggggna aaaaatnaaa nggcntcntt gnttaanccc 420 
ctggganact anattntgtt ctttgccact gaatgcatcc taatgctgga actg 474 

<210> 420 
<211> 477 
<212> DNA 

<213> Homo sapiens 
<400> 420 

accttngcnn gaaacatgaa tgctnacacg cagtggtgca ccacangcta ttgcactnag 60 
ngagagcccg atttgttngc tttgngcccc tggantggaa tcccagnggg aagatngnna 120 
tgagagtcna ggnctacgga tgttnctata aatcagacgt tgctgncttt gatggccnna 180 
nctnacttct gnacaggntc aatnaaaagn tgatnantac tntcaaanat gtgatctncc 240 
tgaagttcaa natcatgcna ggagatgggg tcctgttcca tggagaaggn ggggggggag 3 00 

accacatcac cttggaactc cagaaaggga aggctcgncc tacacctcaa tttggnggnt 360 
tgtagttctc cttgaagagg tccttcacat cccttgtaag ttggaaaaac attccatgct 420 
catgggtagg aagaatcaat atccgtgaaa atggccatac tgcccaaggt aatcttg 477 

<210> 421 
<211> 292 
<212> DNA 

<213> Homo sapiens 

<400> 421 
gtttatttgc aagatgggtt tgagggaatc 
cctctggatt aaaagggatg tttggatgaa 
ctaaagaaaa agattattct acagaaacaa 
cctctgtgca cttgagaaga agacaagact 
aaacggatgc acttgggcaa aatcatttga 

<210> 422 
<211> 98 
<212> DNA 

<213> Homo sapiens 
<400> 422 

agagctgact ttanagggat caagaatatc tagntggatg gaaggagggt aaactcaaag 60 
gacatgtcat gaattcctga accacaataa atctgtga 98 

<210> 423 
<211> 103 
<212> DNA 

<213> Homo sapiens 



aaggataaag tctgctgaaa 
gcttcaatct caagaagagg 
cacatcactg gatgcctctc 
ctcctatttt tagatgggaa 
taaaaatgga agctgaacct 



gtagtaccag 60 

caagagaaaa 120 

accatgcaat 180 

agctgaggca 240 

cc 292 
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<400> 423 
aaattccnng gactaancnt gancacaact 
nttctgccta agaataagaa gaacttaata 



ccatcggctt 
caaatggaaa 



tgaagattct gtgccttcta 
att 



60 
103 



<210> 424 
<211> 376 
<212> DNA 

<213> Homo sapiens 
<400> 424 

gctacctctg ctcactctgc cctgataaca ctgaatacag gaactgtctc catcacccag 60 
aactcccgga accaagcact cagcccgaca cgtcatactt attaaaaaca cggaggtcgt 120 
gagtggattt ccacgtattg ttctagatga tggagaggcc tgaagagtga ggagtgggga 180 
agaaatgtca tcgctgtttt cacctgcacc cttgtttcag agaagtgaat agtcattcat 240 
ctctggtcaa caaaatgata atagtagcag caacaataat attctctttt tttgagcact 300 
tcttatgtgc caagtacttt atgtatgcat tatcataaat aaagcttttc accattncct 360 
taattctttt attttt 376 

<210> 425 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 425 

agaaaagcaa tgtcttgcag tttggtggga gagagtatgc agtcaccaac atggcatgaa 60 
tttaggagtg aataaacg 78 

<210> 426 
<211> 330 
<212> DNA 
<213> Homo sapiens 

<400> 426 

tgtgagggtg aggacctntc ctggctttca ccttcaaccc tcacctcacg aaggaggaag 60 

gtgcagatac tccataggtg cttaggagtg tnagtgttna gngactgctg caagaaaaga 120 

ggagatacga tctgatcact tagacttcaa atccaaacct tgaaaagtcc cacccagtgg 180 

aggactcttg ccgccttgag agaacacagc tgatgtccgg aagcaatatt gntaacntta 240 

ccaataantc caatcaaacc ccaaaaaaaa aaggcccggn ggcccattta ncttggantt 300 

accaggctga acttgnttaa aaggggggga 330 

<210> 427 
<211> 291 
<212> DNA 

<213> Homo sapiens 
<400> 427 

tgatcctaga ccatccccct tcgcccttgt tctcaactgg ctgggaagat tcaagagagg 60 
cttccaacct gctggcagtg acggatggca gtgcagaggc acacaatggc aagtgcaggc 120 
gcgtcaccag ccttgcagct ggccttccaa agaaagaacc aaagtcgaag tctgtcctga 180 
cagaggctga tttaattaag gttatagcaa agggcagaac tgcctgtggg ctgcattctc 240 
tgcagagggc caaagacaat gcattaaaat acttctcagg aagaaaaaac c 291 

<210> 428 
<211> 304 
<212> DNA 

<213> Homo sapiens 
<400> 428 

atttctcatg gaaaaggacg gcctggagcc tttgaacagg gtctgtgtct tcctcctgtg 60 
tcagcaatgg gggaggaaaa cgagcgcact acggggtaaa ggaggtcacc caagatctca 120 
agttcacgag tggcagcctg gattcaagtc cctgcctgcc tccagaacct gagctctgaa 180 
acgctggact aatcagaacc tcttggccct gaaaaatgag gcctattgaa cagagacatt 240 
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tgtaagaaaa gggactatta caacctattg taaagtaaca agcaaataaa aaatgaaatg 
gccc 



300 
304 



<210> 429 
<211> 248 
<212> DNA 
<213> Homo sapiens 



<400> 429 

gcgattactt taaaacatga aagaaattgc accttttcct taagggcaag atggtgctgt 60 

gggctttcct ctctcctgat gagatgatgc aaatggactc catagagaaa cgctgcccgt 120 

gtaacaatgc agttacgcaa cccggtgcat gacacatgaa ttgcagcgca cctgagatcc 180 

tgatgaaatc ctgggagcct ggagctgtca aacatggttt taaaaaataa agggaataca 240 

cccagccc 248 



<210> 430 
<211> 460 
<212> DNA 
<213> Homo 



sapiens 



<400> 430 

ctgctccgtc ctgtccggag gcttcctgaa ggcctgtgtt ctcacctgcc cttagtggga 60 

aaccttctat tcatctgatc tattttcttg tgggggtggt caaggggccc attatgtctc 120 

catctccctt tccaagctcc aaagatnatc tgttatgggg gcttgccatt tgtaatctcc 180 

accaaaaaat tcattattgt tggaaaagct tggattattg gattttgggg gaccttgtgg 240 

ttccttgccc aaaaatccca ttgtccaaaa ccgcccattg gtgggatggg tggtggcttt 3 00 

tcccttttgg cctttcttgc catggatttg gaaaaagttt tcccttggag ggccctcccc 3 60 

aagaaaagcc caaagaaaag aatggcccgg tccattgcct ttccttggta acaagtcctt 420 

tcaaaaaaaa cgaaatggtg gtccaaattt aaaaatcttc 460 



<210> 431 
<211> 176 
<212> DNA 
<213> Homo sapiens 



<400> 431 

tctcagcgga tgatcttatc tcctgctaca tctagaaaat ggaagccatc agactccatc 60 

ttctcaccac tgaggctaca aaagatatct acacctgcaa ccctttccct ttttttcttc 120 

ttcccttttg ttatgatgta taaagtgtcc cttatctgat aaagagctaa tcattc 176 



<210> 432 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<400> 432 

gtgcctcggt atgggaaact tcctaagatg ttgttttggc tgtaaatcat gcggccctct 60 

cagagcaatg catttgtgtg atttgcccaa ttgtgcatga gtacagtcag catggaaatc 120 

cagttcaaac tgcagaagat cagcacctgt gagctgaaat gtgcatgtgt attttacagg 180 

gtggaggata gtgaagacag attcaagcga taatacatca ggtttaaatc ttctataaat 240 

gagattggat tactgcagct gataaacatg gaaatgagta attaaaacat ggtgtgtaag 300 

g 301 

<210> 433 
<211> 443 
<212> DNA 
<213> Homo sapiens 



<400> 433 

ctctttcaga tcttcaagaa tgtttaagca tacaaagaag ccccgagacc acaagggtga 60 

gaactaccat cctccccgct ctccggatgc tcccacagcc tgggctcccc agtgcagagc 120 

cagcaccaag caggagatgc agtacagtgt gcccaggacc atggcagcca tcacatatgc 180 
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cctccactgg ggaacaagaa gtgcgttagg ctgatgtact ccactccacc tccatacgtg 
tttgtgcagt gacaccagcc tggagggcct tctatcgcca tctccctcct ctgtaaat'cc 
tacccactct ttgagtcttg gncccagggg ctgntgctct ctntntctca aatgatttct 
gtgttctcat ttgtctctgc cttctctggg aatctttggt gccacagggt aatctcctgt 
gtgtcactcc tgacttcgga age 



<210> 438 
<211> 229 
<212> DNA 
<213> Homo sapiens 

<400> 438 

gccctggcaa enactattge ettttctget tctttgagtt tgactatcat ggatacttct 



240 
300 
360 
420 
443 



<210> 434 
<211> 288 
<212> DNA 
<213> Homo sapiens 

<400> 434 

ccgtgcttcc caccaagggc tcttggatgg aggtgtcaag gtgtgaagac acagcccacc 60 

tagagaggag agactgetga cctgctaact gaaaatataa gcaagccctg acatgccaca 120 

ggccgtcgga agagacattt gcttttgagt acccagccta ttctactctc tgacttatgt 180 

agatgggaca aatggtgccc tgggcacact catctacaca teagectgaa ttagctagta 240 

aatcacaact gcagtagcta ataacageca taaagccttt tgaatgtt 288 

<210> 435 
<211> 383 
<212> DNA 

<213> Homo sapiens 
<400> 435 

ataacagcac tatgggaagg aggaagaatt taatgaaagc ttgtacctgc tggctgaaac 60 
taagcagect atttataaac tgctctgaaa tgecagggag caggtaactc ccaaatgaaa 12 0 

aagcaagcag gtctctccca ccatcagtgg gatggctgag ctgtctgtgg tgectttgea 180 
tettgetget tcgctgaccc tgaaggtctg ccccagcctc aggegaccaa gcctacagcg 240 
acctcaagga gcagctgcct catcagtget tgtaggaggc tcaggatgga gaggggtctg 300 
atgcccccat tttgttccct tcttttgtct tcttttgact tccctaggga agggaaaatg 360 
tgctatgaag ttaaaagagg aat 383 

<210> 436 
<211> 251 
<212> DNA 
<213> Homo sapiens 

<400> 436 

atagaaaaga agataaacac tcaccgcaga gagttggctc catgtggatc tcaatggctt 60 

atggtgaatc acaatttttt catctgactt ctgttctttg ggctctgact cttcatcaga 120 

atcaatgtca agggecttet ccttgtagtt ttgatacagg acagcatttt ctgcaagaaa 180 

acaaggecta tgtgtcacta attgttctca atcattatgt tacttgttct aaataaacat 240 

catatgtacc c 251 

<210> 437 
<211> 220 
<212> DNA 
<213> Homo sapiens 

<400> 437 

gtggcttgaa atttgaaaca ccatatgaag gttggggagt ctcagggaca gcccagctgg 60 
ggatctgaag ttgctggaga agattttgee taggctggee agcaactggc agacaagagt 120 
catcctttca caatgctgga gacagtagac cttcttcagg accacaagca agtcaccatc 
tctgggtcac agcttcctca attaaaaagt tagaagatag 



180 
220 
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acaaatattg attttcaaga tcaggaaaaa taccgggacc agaagacaaa tttcagagcc 120 
acctaaattg tggagtctaa taaaagattc ctttctccta atgatgtgac catccaaagg 180 
atacactctc agtgtaaacg taaacccaga ataaaatttt atcatcacc 229 

<210> 439 
<211> 309 
<212> DNA 
<213> Homo sapiens 

<400> 439 

cagttttctg cacctgcctt ggtatttgac aactccagcc 
accaatgctt cttcagcttg aagactaaca tctagaagag 
aggagtctta tcttctagaa agtttttcaa acatcccaac 
ggtgttcttc tacagcccca cacatgcaaa catctttatt 
tcttcgtata tgtgnttttc tataagtaca tttatatgaa 
cacttcctc 

<210> 440 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 440 

ntcaacaaac ttnaacttnc cgggnttgaa aggacaaaac 
tggggggaaa ncaaacgggt ttnaaataaa ctnttnatat 
aaatcngggc catttnacna aaaaatgaan tnggcnccca 
ggggtggnaa aaggctnttc cggttttgac tgggggcaca 
gctcttaatg cccgcccgtg gtttccggct tgtcaagcgc 
ttttttgtca aaganccgac cttgtcccgg tgcccttgaa 
gcaagcgccg ggctatcgtg ggcttggcca cgacagggcc 
ctcgacgttt gccacttgaa ancgggaaag ggactggctt 
ggggcaanga tctcctgtca tctcaccttt gctcctggcc 
cttgatgcca atggcggcgg ggtgnatacc ctttgatncc 
cacccaaccg aaacanttgc attcgaaccg aacacgtacc 
gtccaattca agaagatnct ggacnaaaaa caatnaaggg 
tgttcgccaa ggcttnaaag gggcgcattg ccccca 

<210> 441 
<211> 599 
<212> DNA 
<213> Homo sapiens 

<400> 441 

ccctgtgtga ctcatggaaa acagggagtg acgggtcaag cagagaggaa tgtgaactta 60 

gtgggtaatg ccataaacct ttggccagga cataagcagt agaagcagcc tgcatgtgtc 120 

atccatgaga aggccccgct gtgactgcag aggcaggaaa ccaggtgtca gtggagacaa 180 

aggagtcctc ggcgcgtgaa atgggacttg gagcagggcc cgacgggagg ggacagagga 240 

tggctgccag ccagacagtc ctaactcggg gaattcagtg accacagcat ccccgggtga 300 

cacggctgtg aggccttcag agcatcacca ttcagtcacc cctttttaca ctggggaaac 360 

tgaggctcaa ggaagtaaag cagaaatgcc tttagcctgg gcaagaaggg acctgtccta 420 

nccctgcatt ttgggagcag tgcttcttca actacctaan gcaaangacc catttgggtt 480 

tcaacctctt atcttgttca nactgatagg ttaataagaa acaataaaaa tgatttgccg 540 

ggcaaggngg ntcacacctg taatnccacc ttttggagnt gacccggcag ataacctga 599 

<210> 442 
<211> 512 
<212> DNA 
<213> Homo sapiens 

<400> 442 

caagaacttg agacggggat cttccttttg tacccggccc catngnttaa nncnngnatt 60 
ccnaccnttt tggnagtccg aggcgggncg ggntcacgaa ggccaggagt tcaagaccag 120 



aattttccac ttgcttcctc 60 
tcatgaagtc taaagtcaag 120 
ctcaaaaagt ttggctaaat 180 
gcacttgtgt cattattttt 240 
ggnatatttt gaaataaaga 300 
309 



ttttttcggg gctttttcng 60 

anaccccccn cncctttggg 120 

agggttttcc gggcccgttt 180 

aacaaaaaca aatccggctt 240 

aaagggggcc ccccggtttc 300 

atgaaacttg caaggacgaa 360 

gttcctttgc gcaacttgtg 420 

gctattgggg cgaaatgccc 480 

gagaaaagna tncatcatgg 540 

ggttaccttg gccattcann 600 

tcggaatgaa acccggcntt 660 

cttcgcgccc acccccaact 720 
756 
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cctggcctat atggttgatc cttctagtct cgtggcagaa ctttgtagac accaagcgag 180 

aggggcagcg tgttctggac ctcattcctc acacagggct cacctccgga tgagtcagag 240 

gccttagccg gtggcccagc cccgggaatg ccaccccggt tctgtaccct gcccaggcca 300 

gctgacaggg tgtattgggg cacacacctg cagcatccag ggcactccaa ggagagggac 360 

gtacttttga ggagaagtct aaaagtctaa gtccaccacc tgaacttggt gggggaangg 420 

cttctatacc aagagggctc cccgcctgtt cttaaaagcc atttaagcag aatgacgtgg 480 

ctcttcaata aagtaaaaat gggtcatgct gg 512 

<210> 443 
<211> 223 
<212> DNA 
<213> Homo sapiens 

<400> 443 

gattgctccc tttgggagac accagccacc attccatgag ggcactcttg gagaggttca 60 

aatggaaaga atctgaggtt tccactaaaa gccaatacta tcttgccagc catgtgagtg 120 

agtcaccttg caaatggatc ctccagccca tcaggtctac aaataactga agcctcaagc 180 

tgacaacctg actgtaatct cataaagtca taattgacca act 223 

<210> 444 
<211> 618 
<212> DNA 
<213> Homo sapiens 

<400> 444 

gctggagtgc agtggcagga acacggcagc ctcgatctcc tgggttcaat cctcccacct 60 

ccgcctccca agtagctgga actacagatt ttaacaatca gactcaggtc aacagtggtt 120 

gagataatgg cccataattg gctccagaat gcaaacgtgg catttctcca ggattccatt 180 

agctcagaat gacaaggtga ctccctgccc ccacctccct cacaagatgg ctccccgggg 240 

cttcctctga gctctgtccc tgtcctgcac ctccctgtgg ggacggctga gctgctggtc 300 

ctattggagc agcatgaaca ccttgctggg tgttcatgag ggagaaaagc tcatgaagga 360 

atgaatcaga gttggatgct atgcatataa atatttaggc ctgtaagggc ttctctttgg 420 

tgatctgatt ccaccacata ccaggtacct cagcataatt caaacattcc tgcaggaaag 4 80 

ggtcataatc tctgctctat taaagtccaa tttatccttt aaatgaaatc tactcacagt 540 

cctgcagatg aagactactt nctgccgatg accacagcgg ctaagangct gaggcaggag 600 

accgcttgac ccagaagg 618 

<210> 445 
<211> 459 
<212> DNA 
<213> Homo sapiens 

<400> 445 

agtggggctc cgtttggctg cctgtttact aaacgtttca gaagccggaa gaaaatacat 60 

tgttgagaac atagcaaaag cagctcttct tgacaaaaat ggaaagaaac atcctcaagt 120 

ttcagtgctc aatatatttt ccgatcaaga ctacaagaga tcagtcatta caatagcaac 180 

ttctgttgat aagttggtgg acaagcgcaa ccaagcctaa aggcaagtgc tgttgcgagg 240 

tcgacatcca ggaaccagag gagggcagag caatccacag aatggatctg gggtgactca 3 00 

tggaggaaaa ccaacacaca gtaccattta attcttttta aaaagatgga aaattatacc 3 60 

atacccngaa ttactaaatt cttaaaagag ggggtttntn gcattccatt tgnaaaanaa 420 

ngtttcccca tgttctttta aaaattcatt ttaaaccac 459 

<210> 446 
<211> 403 
<212> DNA 
<213> Homo sapiens 

<400> 446 

gccttcagac tcagattgga aactacagca atggccctct gtctctcagg cctttgaacc 60 

acaccactgg ttttcctggg tctccagctt gtagatgact aatcatgaga cttcacagcc 120 

tccataatcg gaatgaaaac aatggctagt cctggattgg tcatctttaa ctttgatgag 180 

atgctgaaaa tgaaagccag gactgaggga agattgaagg agtctgaacc tctgacaaca 240 
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tggagtacca taccaaccct ggactatcta cctccagact tttacatgag taagaaacac 3 00 

ctagtttgnt caaaacagta ttaatttgga tctttgntac ttgcagttaa acctaatcct 360 
gaaatacctg cattctcttg aagtaaattg ctttcaaaaa cct 403 

<210> 447 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<400> 447 

tnccannctg aggcccaatt ctgtnggaat tgctttttta aaaaaanttn tangnntnan 60 

ttngaantnt gcctgtccan atttgngggc cagagattta gaccctcatc ctcaaggcct 120 

tattcctcac aaaagccata tgtaaaactg gctgctccac aagggctggg atcctgtgtg 180 

tctcattccc cactgtgtca tcaagtgccc agcacaaaac agagctcagc aaatgcttgt 240 

cgaataaatg aatgaaaacg tgctcagcac agggaggtaa aggcaccagg accccatgga 300 

gagagagtac atgctgagtt ggctacatct gtgccaaact gtgaaagatg acaatggaga 360 

tatttctctc tacagtttct gaagatggac ccagcccaac acttctttcc atgcctggct 420 

gtttttaact gcaggcacag cactagctgg tttgtctcaa agattatggg tcaaaagaga 480 

actgagagac aggcaagtat ccccncggct ggacatactt tacttgccgg caatacatag 540 

tgctcttctt gcctgacaat tcgaacaagc agcttgactc tgtatttgag gccccactcc 600 

cttttggcta actagaccan actaatttac tcatt 635 

<210> 448 
<211> 81 
<212> DNA 

<213> Homo sapiens 
<400> 448 

actgaggttg tgcaggaacc cccagacacc cgccccgggc atgctncaca cangnggcgt 60 
gccccctgca caaaaaaaga a 81 

<210> 449 
<211> 616 
<212> DNA 
<213> Homo sapiens 

<400> 449 

gttttgaatg gtgctgtttg gtcacaacat ccacttgctt tgaggtattg ttggccttgc 60 

tctgctnaca ttctgagaga tctgcactcc aggcaccttc tgtggacatc aagctcacgt 120 

tttaccgtcg ccactgaatt tggccaccct cccccctcta ctgtgcttct gcgctacaac 180 

tgtcccctcg tttattcaaa catggagttt tctttcctat ttatttttgt ttgctggcat 240 

ttttagagat gagactgcag aagaactttc ttactatgcc attttaaaca cagctatctc 300 

atgatttttg taaaatccag atataattgn tgnctttttt tattcttgcg taaagtgtga 360 

aatcttgcat accttcatgg nattttgtaa tcagccccac ctatttcatc ttcatcttct 420 

gctgcttnct cccacaactt ttgtttggct acaagatgat atcataccaa atcctcagtg 480 

gcaaaatgtg tttctnctga attcataaca taaaaaaanc cattaaaagg ggggtangca 540 

tacctgataa ctattactgg aataaaaacc cggactcacg ccttagaaan aaaaaaggtt 600 

atcaaagggc aacaaa 616 

<210> 450 
<211> 617 
<212> DNA 
<213> Homo sapiens 

<400> 450 

tgctgctgga gctgattccc ttcccctcct catctnccac ctnctttcag tntcacatac 60 

acacacagat gctgccacag acacacgcga gcgcaaatat ttacacactg ccacaccgaa 120 

gaaatccatg cacgttttcc tgcaaacgcg cgcgcgcaca cgtacttcgg cgggcgccca 180 

cgtcctctgt ctcaccaaca gacacagaca tttacacttc taggccagga aagcgctaac 240 

cagggccctg tgactctacg caggttccag aacacgcctt ctacatttgt tactgaaccg 300 

atcagcgaac acagacaaac gtgccaacac ttaaagtcta ctggctggac ttcatctnca 3 60 

tggcaacaaa gcatggaang naaagagttg atttcagaag gaactgngaa gaagcncaac 420 



97 



CAl- 201292.1 



aatgngccca gtgataatga gtagnaccta 
cgaaagatta nctttnttat tgctctngac 
ttgggnatct tcttgggact tntttttttc 
tgnngcatgg ctcaatt 



tgngggactc ttnancttaa angantggca 
aaaaaaantn gnttttnttt tggngggaat 
cgatggcttc aaatcctggt ngacccttnt 



480 
540 
600 
617 



<210> 451 
<211> 203 
<212> DNA 
<213> Homo sapiens 

<400> 451 

ttttcagatt cttccagcaa tgtactacaa atttctgggg aaaaggaacc atgtgcccct 60 

gccaagatgc ccagtgcagt accagcaaga tggccaacgc ctagagctcc cttgttgatc 120 

tgaaacctcc ccttttcctt acttctccct ctgttcagaa tgtgtagact tctctaagct 180 

ttgttaaacc tgtttacaac ttc 203 

<210> 452 
<211> 445 
<212> DNA 
<213> Homo sapiens 

<400> 452 

gtgttggaag gatgtcagat gagagctggg atggggagag gaagtaagga ggaaagataa 60 

gcagctccct tccattctga cctgctgtgg caagaatccc gggactagca agaccaacag 120 

gatgcagctg gcttcactga acataatttg ctattagcat cttcaggaac acacactgct 180 

ggataaattc ccttccagga gaggccacaa ctgaccacta catggaagag acagctgctt 240 

cttcactagc caatgaggca tccccaccca agtgtgacca aatgcctctg aggctcagcc 300 

cctcactcca gaatgccccc aggtacctga ggatgctcca gatttggggg ctgcaccgtc 3 60 

tgtggtttct ctacattaaa cagtattttt gtggagtcag gggtgaggga gtatgggtta 420 

cttttaaata taggtttgcc aactc 445 

<21Q> 453 
<211> 460 
<212> DNA 

<213> Homo sapiens 
<400> 453 

gggcctgaga atgtcactgg ccagaagaag ttgagtccct agtgtgttga cccaccagtg 60 
ctctcactga ccaactaagt gactgggtac aaattaaaga ggagaatttg aatgtctggc 120 
tgtctgggaa ataaaaggtc agagagttga ttagcaccat caagccccaa tacccagaat 180 
catggagaga aacagtggct cggacctcta agcggcacct ccaatgactt tcctgcacct 240 
tgggggattc cctcgcccca ttttttatcc cattgcccct tctgtgccag tctcttcctc 300 
tgcgaggaag tggtttgaga accctaaaaa cgaatccaag gaatcctttt tgtttggggc 360 
agttttctgc aggcaacatc tgtgtgcatc ttagttgtca caggtctggt caaatgtaga 420 
gatgaataaa ttttaaaaat aaacaactac aaaaatacac 460 

<210> 454 
<211> 261 
<212> DNA 
<213> Homo sapiens 

<400> 454 

gccctgccac catgccatga ggaaatggaa agaccacgtg gagtggtcac atataaatgt 60 

tccagccacc agcctcagca gaggtcccag cccacagtca gcaacaactc cagacacgtg 120 

agtggcagca agatgatgcc agccgcagtt accatctgat tacaacttca taagaaaccc 180 

tgagcaaggg ctgcccagct gagttcaagc aacgccccag acctgtgggt gatgataata 240 

aaattattgt tgttttgagt c 261 

<210> 455 
<211> 591 
<212> DNA 
<213> Homo sapiens 
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<400> 455 

gaaaagacag aagctgattg aggtcccagc ttggtaacag tttgaagagt tgcaggactg 60 

gctggatgag tactggctgc agcaaatcag gctgccagga ttctttatgg ctgtttctgc 12 0 

ttccactaca gctgagtcag aaaggtcgct gccccgtggt ggcactagac gcagtggacc 180 

tggcaagcaa atgtttccgc tattagctct cagcaacaga gactcattta tggtcacctt 240 

ggaaatctgg gcttatcgat ctacagccca agtctgctga gaagctggag cttactaaag 300 

gggaaacctg agagctgttc aagccccaaa tattttccac ttctgcgtca cctctgctgt 360 

ctgttagcag agtggaggag aaaatacaca gcacaaacaa cgtgaaaaaa tagttactct 420 

attcattaaa agctgtaact tccagattgg acttgagaag cattaaagca acagaggacc 480 

ctcatctact atctgtattc aagcatgctc atgaaaaaca cgctgctcaa ctggacttan 540 

aaggaacccg ngcatnacan gcatttcttg acagaatctc gtgggcctgg t 591 

<210> 456 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<400> 456 

gctccttgtt taagccaaaa ctgntaaaga ggaatcaggc tcagagaagc tnaagaagcc 60 

ggcctgagtc ccagctagca aacagcaaag ccatgatttg gacagaagcc tgtgtgactc 120 

caaaacccac gctcttttca ctgtgatgca cggctaatac tgagctgagt gatgggaagg 180 

gagctctctt tgngggattt tcangatacc ttcaaagatc angntggntc tgtttgcaga 240 

cccaactttg caaaggacaa gcntgtgtct tnactcacac tanctcggcn caggttctga 300 

gcccttttgc aatnggaagt tatttaacct gatcacanca aaatgaaaga ttatttgaaa 360 

accngggatg tgaaattctt ggaacccaaa gaaaattatc ccatgnttct ccaagnacct 420 

ttgccacccc ttgtggncct gctaggncac atggacccca aacctttcca gaaga 475 

<210> 457 
<211> 145 
<212> DNA 
<213> Homo sapiens 

<400> 457 

gtgctggtca ccttacccaa cctgcggcct ctacacagag aggccttggg ggagaggaaa 60 

agcttctcca gtgattgatg tcagcagctc acccganagc caagaacatc anaggtggga 120 

tgatgatgct ngtggctatg agaca 145 

<210> 458 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<400> 458 

cagaattggg acatattcca cttggggcta ggagccaact cctttccctg ctgctactgc 60 

tcactccctc tgtctcatcg aggagaatgc tccacccagg agcacagaat gaaaggcaca 120 

gagtatagtt tccagaatcc ccgcatttca gtgttcccaa agggctgaat tcttgtcaat 180 

agaatgtaag tg'gaaatggg ctatgtcact ttcctgctga agaggttaaa aagaaggtga 240 

actctcttca tctgcagttc ataagataga aggatcccgg gtccctgaat gaccteatgg 300 

aaggccatct aacaggaaca cccacattgg actgtgatat gggcaagaaa taaactttaa 3 60 

ttgcattggg tcagtgagaa gttttatctg ttacggcagt tacttctact ttaataaata 420 

caatgcatta tctt 434 

<210> 459 
<211> 493 
<212> DNA 
<213> Homo sapiens 

<400> 459 

tctggggagc tcctgcatta agtgagganc tgangaaaca ngcantanca accagaagac 60 

aggaggcaca agaagttagc aaagaaagcc acctacttct tccgccttaa tttctctaag 120 

cacttatcaa gcagaagaat cacagaagaa tacaataaat ggtctagaaa ctgcagtgat 180 

gatttactaa aggaagagcg tggttccccg agcaatggcc ccatcctcga gcccgaagac 240 
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ccactgccct aaatgaggac agacatttgt ttttgcactc aaaaaagttg ccttgtggct 300 

cgccatgccc cctaatct.tg cccccaaata aactcgagac cttagcgggc acgcactcaa 36 0 

gtggctgaac atggagacca gcagaacagt gccggcggaa tgacatggcc gagaaagaga 42 0 

gaaagangag ggacattttg gaccccaagg gaaatttggn ccggggtggg tngaaaaaaa 480 

atttggccct tga 493 

<210> 460 
<211> 404 
<212> DNA 
<213> Homo sapiens 



<400> 460 

aggcccagga gaaaatatga acaaaaattg gtgaaggcca tcaccagacc tagcagttgc 60 

atcctgttca gcaccacaga cagctccctc gcaaatgcca tcctttcaaa aggtaccata 120 

cagaagacag ctactgagat tctgcagatt ttctaaaagt gacatttcta ttacacattt 180 

cttcttttca gcactgtcat atgtaatggt atgcattatt gcgttgtgta cattttgtga 240 

tacatcatca atctgctaca ctaccccatt aatccattca ttcaataaaa tacattgtta 3 00 

tgtgccagat actcttagac aagtcactta ccccnttagc ttcatttcct tacccaaaac 3 60 

ttgnggatca ttatacatgg ttgataacta aggaaaggat tttg 404 

<210> 461 
<211> 583 
<212> DNA 
<213> Homo sapiens 



<400> 461 

gatctccacc atctgggggn acacggggaa ctggnacntt gggnggggcc tcaanaactc 60 

cttcaacnaa ccctttccac tggcccgaac ttnttgtgca ttnccacaag cttggcgacg 120 

gggtggatgc cttgcctttg gatgggaaga atccttgcaa gtcaagacta cattccttgg 180 

caccaaggtg gccaaagccc gtaccgaact tggcttggaa gcttaccttg ggcaccaaga 240 

aagaaatgga cctttcttct tattgaacaa tttcttcaaa cttgggccaa ngggttcact 3 00 

ttcaaacttt tcttaaaanc ttggnttncc aagcccacac caagtcaagg gggaagtctt 3 60 

ccttggtatt ggaaangnac ttggggtngg ttttgcttgg aaaccgggct tggaaattgg 420 

aangggcccg gggaaaccgc cacccccacn ttacccaacc ggtnggggng gaaaattggg 480 

gcattttacn aaccgnaaac aaagtccccc ttggcattgg aaattcccct tnttttttgg 540 

ggggaaanaa agtnccccgg aacnttgggc aagaaaccgg aac 583 

<210> 462 
<211> 339 
<212> DNA 
<213> Homo sapiens 



<400> 462 

agaaaagtca gcaaaaactg cacattatac agggcgacag gcatggcagc agtttctggt 60 

gcacatgttg cctgtctccc ggtgacagaa gataacagag gactaagagc gcacatatac 120 

ctcaagagcc ctaaggctgc cacaggaggg taaacaactc cacccagcac tgctccaggc 180 

cggcacaacc atcaactttt catgagcggg cccactggct gctgtctgga atgaagaatc 240 

ctatgttgct ttccagcctc acatttcccc tttgtgtact acaaaatagg agctgtttca 3 00 

ttagaaacat aaaacaatga ggaagaagct gttattgac 339 

<210> 463 
<211> 662 
<212> DNA 
<213> Homo sapiens 



<400> 463 

ngggaaannt acccnggctt tatttnanaa attgancggg gcgggccttn ccaacttacg 60 

aanatgcttc aanggaggga gccaaggaaa gtggctnttg cttggggggc gccccacaac 120 

ccttgctccc ccgatgtcca cccgtggatc cacatcccgc cagccgaaga cctcccgtgc 180 

cttggaccat tgtcgtcttg ggtcccttgt tcaaacaccc ttctttatgg aaacaccttg 240 

cttgccttgg ggctttcata agccatttcc gccctacttc ccgtggaaaa gttctaaggg 300 

gacaagggga aagaaatggg gtttgggcgg aaacgttgga acccgggggg ccccaaaggc 360 
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ccttaattgc ccttnccacc cggcccaaan gtnggccctt ggaaacattt cttnggggcc 420 

cccttngaat tttttngggg gcattncttt tcattggaaa caattttctt ggntttnatt 480 

tcaatcaatt tcccaaaatn ggtttgggtt cggttcccaa nggcccccaa nccggaattt 540 

gnaattaaan ggganngggc ctttcntttt ntaangggcc caanggaaag cntntttggc 600 

ccccggnngg aaccttgttt tttncccacc gtaaccnttt tatttttttt ccatttttcc 660 

tt 662 

<210> 464 
<211> 459 
<212> DNA 
<213> Homo sapiens 

<400> 464 

ataaaggaat actagacatc aaaangttta ttacggngan ggacatatag tcatccttcc 60 

agtttaagat ctaagagcaa tactcaaaca gaaatcaaat aaatgtctat gacaattaag 120 

gcaaacatac tcatttgtct acaagcaaag agcattttgg aaagaacact cccttgttca 180 

aattttggtg aactggttgt ggagacaaaa gtgactccat cttggatgct aatctgccat 240 

gttgacttct gattaacccc agtctgggga atgcctctaa gatttctatt tttatttatg 300 

tatactgtct gtaaaccctg ttcttaggcc aagacaccct tgatgttatc aaatcctgcc 360 

cttaggctat gacacacata acattctttc ctttttctgg anaggggggc ttcaattggc 420 

cttatacatt ccttntaaag cacatatacc ctttctctg 459 

<210> 465 
<211> 476 
<212> DNA 

<213> Homo sapiens 
<400> 465 

gctataagga tgtgtttact gcagagacaa acagtaagaa agtatacaaa attaaagaaa 60 
aatgacagtt atctttacct atcacttcaa gttatttctg tcaagaggta atgacagtta 120 
ctgaaaaaag aagttctgga cctttttcat ttgcaaactt atttttacaa atggcttctt 180 
ttcacataaa ggatttgtga tggtttaatt ttgtgtgtca acctggctgg gccatagtgc 240 
ccagatattg agtatatcat tgttctggaa gtttctatga aggtgatttt tggatgaaat 300 
tatttaaatt ggtggacttt gagtaaagca gattatcctc catgatgtgg acagacctcc 360 
cccatcantt gaaggaccgg gccaaaatga aaactgancc ctttgaggaa naaattctcc 420 
aancanatgg cctttggtct gtttctctgg agaactgnga ctaatacagg ttcttc 476 

<210> 466 
<211> 218 
<212> DNA 
<213> Homo sapiens 

<400> 466 

ggcctcttgg gggaacttcc ctgcttttaa gtccanaacc tggagantga ccaagaanca 60 

cctcanaagg ccagccaccc tcaanggagc aacccattgg ncccagactt ntcgcacgga 120 

tgccagaaaa actttnaatt ggaaggaagg cttgaaggtc aacaatgggg naaanaagtt 180 

ttttaaaaaa ataaaaaang gggagcctaa tattgtgg 218 

<210> 467 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 467 

cccgtgcatg gtggcttgtg cctatggacc cagctgctca agaggctgag gtgggaggac 60 
tgcttgagcc caagaagtcc aa 82 

<210> 468 
<211> 90 
<212> DNA 

<213> Homo sapiens 
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<400> 468 

cacttttggg agggccaaac aaagaangnn ttggttngac cccaggagtt tgaaaccaga 6 0 

actggacaac atagtaaacc tcatccctac 90 

<210> 469 
<211> 262 
<212> DNA 
<213> Homo sapiens 

<400> 469 

ataataagat ccttgaaagc aggcctgaac caccattgta caataaacat ttcctgcatg 60 

aataaattaa tgaaagaatg aataataaaa caagatctct tcccagagaa agtttaaagc 120 

ctctgaagac agcagacatc catttgaata accacataac aaagtgaatc atttatattg 180 

caaaagacag agaaagcatt atacttgagg gcagaggagg gagaaagcat attactcaaa 240 

taaagatgtg atactgaatt ag 262 

<210> 470 
<211> 265 
<212> DNA 
<213> Homo sapiens 

<400> 470 

cngggnttgn naaatnngcc cgtgaancnc anatnaancn cggcccacan aancaatggt 60 
aggaagcata accagagtga atcgattcct tgatcctgct ctgccaaaaa attaaagagg 120 
agcactcctg gggtttttaa cccagataag acttcagcca cagccgtatt tcccatgttc 
ctggatctct tgttctggct cttattctgc ggataaaatg tggaatagag taagcagtgc 



180 
240 



180 
240 
268 



gagttctgcc ggttcatctg gcttt 265 

<210> 471 
<211> 268 
<212> DNA 
<213> Homo sapiens 

<400> 471 

gacgtctggg gagctcctgc attaagtcag aaccngagga aggaaagctn gaaaaaaaat 60 
cgtcaaatgt tgcgggattc ttgtaagcac agagaactat gaagacctga caaggagggt 120 
atctttttct ttcatgcttg tccaacaaga gagcacattg ttagtgtgct tgaattccaa 
caaaagaagg catagaatga atcttggttg ttccctttta cttgctaaat atgtactgaa 
tgaataaatg gtgcattata catctatt 

<210> 472 
<211> 456 
<212> DNA 
<213> Homo sapiens 

<400> 472 

cctgtctggg acctgcctgc agatttcagc cacttctgga tacacctggg acagggctga 60 

tacctccact gtcttacact gtgaagagcg ggacaaaccg atgagtgaca gactactgaa 120 

tcaatcccct tttaagctgc ttaagttcca gatttagttt taaagagaaa aaaaattgtc 180 

atctttttaa aaagactgca tcttctttct cctaatagct aatatttatt gagcattcat 240 

gacacgtata cactatttta aactgccact gtgggttgat gtcactcccc cattttataa 3 00 

acatggagac tttggtaact ttctaacagt acttggccag tcagccaggc ctgtgctctt 3 60 

cagaggcgca atggggnctt tatactacca cctaaaggcn ggtnggatga ccatccctat 42 0 

aactttgttt ttaattnaag acaaacatgt aattag 456 

<210> 473 
<211> 170 
<212> DNA 
<213> Homo sapiens 

<400> 473 

atctgccgcc tcgaagagaa acattttcag aaccaaatac agaattgaca aagagaagac 60 
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ggccttggag atagagccca gctttttcat tgcgcaggtg gaaaactgag gccagatgcc 
gtgggacaga tgcagagaat gataaagtca ccaaatgacg gtgattattg 



120 
170 



<210> 474 
<211> 467 
<212> DNA 
<213> Homo 



sapiens 



<400> 474 

gtctttaacg ttttcgggga cctctggaaa acctacaggc gcggccctgg gaagctctgg 60 

gtccctagga ggggaggtga ctccgcggcg tcccgggaat gatcctcgcg gagctcgcga 120 

ggtactagcg ccccccagcg tctggattga gaaacgcacc ctgcgagggt ggagaaccag 180 

cccagcccca aagtgaggtg gcagaaaaac gaactcacgg ccaaaggact ggctgaggtt 240 

aaccagaatt gtgtaaatgt gttttgtctt gctgggctgc cccctctcct ggtcctttgg 300 

ctagggagaa caggattttg tttgggattt ttcttttgct tttttcgact gtgcctggtg 360 

gcgttcgcgg gnttgccant tttttaaggt ccaaccctgg cttgtttttg ggnnaaaaac 420 

naaaccnaaa cccccaanga attggncttt ngggtcattt ccttggg 467 



<210> 475 
<211> 440 
<212> DNA 
<213> Homo 



sapiens 



<400> 475 

cgagctgaaa tttaccataa tccggctgat gtttagactg cacccatcgt tttttccatt 60 

catctatgag taaaggagaa aaaaagaacg taaagacaaa atgcagctaa tactgaccaa 120 

gacttacagg aacggtaaag ccctgtgatg aatgtcctgt tttttcctca ttcaaaagat 180 

agagaaacag aagctcagaa tcttgcccaa aagcccagtt gtaaatggat tctcactctg 240 

ttgcccaggc tggagtgcag tggtccaatt tcagctcact gcagcctctt cagcagaatc 300 

ttgaccctct ctgagattca gttttttcat ctgtagaaat ggggacctaa ggtacagagt 360 

ttcttctggg agaattaagt gaaactgcat gcaacaccat gttaggcaca ctagaagtga 420 

tcaataaata ctacttgagt 440 



<210> 476 
<211> 438 
<212> DNA 
<213> Homo 



sapiens 



<400> 476 

gcatccattc accangcatc ctcagcccct gctatggcct ggctctctgg ggtcagcttt 60 

gttccctgcc tgccttctgc tgaggaatca gggcagtggc gggggcggcc ccaccagccc 120 

gcagtcactg gcccagacac agcgctggac acaacacccc ccgcttccca cagctgctga 180 

ttcccgagga ctgccggacg cacagctcca taacaagatt ttgggaaaca aagtcaagag 240 

tgagggtgtc attctgaaag gtgaacggtg ctcacagagg aggagcctgt gtctggggtc 300 

gtgtgcatcc tactctgctc acagtggagg catctttgga agaagtgact tattttctgg 360 

tacagagacc attccctccc ccacaccctc tcctaagact ttgtattgaa acaaagtaaa 420 

tcttacagaa attgcacc 438 

<210> 477 
<211> 193 
<212> DNA 
<213> Homo sapiens 



<400> 477 

ttataatcat catgactgca actcaaagtc cttaccaaga ccctctttga atgagaaagc 60 

tctgccatgc cttccctgtc atcatccact cttgcagcac agctggccct ctgtatctgc 120 

gggttccaca ccgatggatt caactgaccg tggatcagaa atacccagaa aaaaaattat 180 

atctctactg aac 193 

<210> 478 
<211> 345 
<212> DNA 
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<213> Homo sapiens 
<400> 478 

ggtcaagttt caggtgaaat cactagacaa gaaatatcat tcagactgcc tagggctgtg 60 

ttctgaagct acagaggtac cttgatgtca ggaagaatag caatggcaga aaatgtttca 120 

tcttgcatgc cagcacagac caatggcaat ggatgtctga atcactgggt taacaaggaa 180 

aagaatgctg tgcttaagta gcaatgtctg ctctgagcat ggcaggagaa attattggca 240 

cctctgtcag atatttgata tctatttctt aaatagaata catacatatt ctaagaacaa 300 

gaaaagcata aacaaattaa taaattactt tctgacttct aaacc 345 

<210> 479 
<211> 240 
<212> DNA 

<213> Homo sapiens 
<400> 479 

ctttgtgctg catctggcct cctgctctgt nttactctgn cgctactnca cctgcatgtn 60 
acctactgnn ggatccgntt ganaacaccn taatttnaga anacagagtt ttgaacatca 120 
ctgaccttta ccatcggtat aaccnactct ttacctccca aggctcgctc atttgtactt 180 
attttttctc attgtctctc aaatttancc aactggnatg aataaactgg aagtaaacag 240 

<210> 480 
<211> 504 
<212> DNA 
<213> Homo sapiens 

<400> 480 

aggaaaccag ntcgacagag ctgtgatttg ccctgngatt tgccctgggc cttnccacaa 60 

ttctagaaac ccatgacttg acatcattgc gcggccacct gactcccagc tggcttcagc 12 0 

ctctncgttt natctccctc tactctnact ctgctgctac caagtcagac ttnttttcan 180 

aatgccctgt atcattttaa tgactggagt gtgactttgt tctcagcaca atgagtaaca 240 

aagccaaaac actggagaat acgtttacgt attnaagaaa acctcagaca aggaagaatg 3 00 

ctttcataat acagnacatt anaatcagac gaagcctnga agggcanaat naccgatcct 3 60 

gaaaaatcan agtgtnctac agaagaagac gacagcgttt gagcacattt gttgaagcag 420 

cctcctntcc cttatggnnc gataatccca caccgnttta ccatgctctc tggccttccc 480 

agaacatcaa taaaaactgc atcc 504 

<210> 481 
<211> 274 
<212> DNA 
<213> Homo sapiens 

<400> 481 

taactggcag aacccacacc ttcaaaacag agactttggc tgcatctggc ctcctgctct 60 

gtcttcctct cacctcctcc acctccatgt cacctactga gggatcgctt gagaacacca 120 

gaatttcaga agacagagtt tgaacatcac tgacctttac catcggtata accaactctt 180 

tacctcccaa ggctcgctca tttgtactta ttttttctca tgtctctcaa atttagccaa 240 

ctggtatgaa taaactggaa gtaaacagtt ctac 274 

<210> 482 
<211> 299 
<212> DNA 
<213> Homo sapiens 

<400> 482 

gtaatcttct catctgtgag gatatggaac cccaacctct tcctggacac ctgatgatct 60 

gcttgtgatg ggctcagagt cttgaaacac agaactatga gctcatctca tatcccaatc 120 

cagcagcatg gaaacctcag actgtaaggc ccaagactgg cacttgttct ctcccaactc 180 

ttttctttct ctctctcctt tcttttatcc cttaattcct tcttgcttcc ttccaagatt 240 

tatactatta ccttttaggc aaaacatcct gaacatgtaa aataaactaa ttaaaatcg 299 

<210> 483 
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<211> 395 
<212> DNA 
<213> Homo sapiens 

<400> 483 

gaggagtctg agaagaccta aaacagaaga gaaaaaggcg aagaagatgc ttaaatatat 60 

acattattca agtaattaac tgaagccttg agcgtacaga tgatctccga aaggacgcca 120 

cagaggggag aaggctggac ttgcagaaca cattgctgtt gaagaagtga caggaagatt 180 

cagagctcac aaagaagaca ggtcagacgt ggagaggcga gccagcagaa caccctcaga 240 

aatactgctc tcctgttcgg atggccagtt ttcatatttt agaatatttt tcaaaaagca 300 

cttcaatata atgaagttcc ctcagttata acaaggccat ttttcatagc tatttgtgta 3 60 

gatagtccaa aagtgtggtg tgttatcaga aaggg 395 

<210> 484 
<211> 440 
<212> DNA 
<213> Homo sapiens 

<400> 484 

gaagaaagca ttgctctgga aagagggaag ttcattcact 
agatgccctg cggctatgga ggagggccgt ccaagctcac 
accatttcac atttagcacc agaatccagc cttggcagat 
cacagctggt ggtgaagaca gaaactcctg tgtgacaact 
gggtcaatta acatttcctg aacaacttgc aaatggaaag 
gaaaaatgag aggctcaact catctattat gacttgaacc 
aaggctgtgc tgttgcacct agacctccac ccagaaacat 
atagaaacat agagaggaaa 

<210> 485 
<211> 199 
<212> DNA 
<213> Homo sapiens 

<400> 485 

tcccgtctga actgttttgt cttggccctg tttccaccca ngaagccgca gatcctgact 60 

ccttgtgttt gtttctctgc ccagatgaga aacacccatc acctctgact ttccaaggag 120 

caaatcacgc tccgtgccgg gctcccccaa caacaccact ccctcttccc ttgcgatctc 180 

caggnctcct ttgacactt 199 

<210> 486 
<211> 426 
<212> DNA 

<213> Homo sapiens 
<400> 486 

ctcncngctt taaatcctag ntggnngnac gggctgntna cctanaggct 
cntcnnaacc acnccnagtt gcttcnagcc tccttngcgc cagcacatat 
gggccaccga tcctaagcca aagcctcccc aacctctggg ctcagaagca 
caactccagc agggaattcc agaggtgaag gtcacgggag catctttaat 
cagtagagaa gatacccaaa gagcagggag caggagccag ctccaggcta 
tattcatcaa tcattcattt atgcattaat cattcattcc ccccacccaa 
gccagngngg ccaattcagn tngnacttaa ccaggctgaa nttgntnaaa 
ccccaa 

<210> 487 
<211> 533 
<212> DNA 
<213> Homo sapiens 

<400> 487 

tttttttccc cccccccccg nggggggggn gnnnncnngg gggccccccc tcttttttgg 60 
nggttcataa agggtggana cncccnttgg gcgccctttt tgggggggtt tnaaaaaaga 120 



catccaagaa 


gagcaaaggt 


60 


agttcctaga 


agtttgtgtc 


120 


tcagggaagg 


aagccaagga 


180 


gccccctagg 


acacagttta 


240 


agccatcccc 


aatgaagact 


300 


caagtctatc 


tgtgtttgca 


360 


gttttggggc 


tgacatttta 


420 






440 



g t nn t aggnn 6 0 

ctgcancctt 120 

ggtgtaatcc 180 

cttcggttcc 240 

tacatttgtt 300 

aaaaaaaang 360 

ngggggggac 420 
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naaaatcctc ttcntggggc 
aacataacan ggggccgggg 
ttttnggtaa ttttttttna 
gtttgggncc caaanaactn 
gngggaaccc caactttggc 
tggcaagggt ggttggaaag 
ggncctttaa taaataagtn 



ccttaaaanc ccctcccctt 
gcccccccca ctttatttgt 
aagnaaccaa anaanggggg 
gggggtcctt ttggaaactt 
ccttaaagcc cttnccccaa 
tcaacccaca cccttggacc 
aaaaactggg atcatatgaa 



ggaagataag gcnngggggn 
ccccaagcct taaaattttt 
gggggttttc cacccaatgg 
cccctgggga nccctcaagg 
aaaggtggct tggggggaat 
acaaggtact aaataatttt 
aatttaatat aag 



<210> 488 
<211> 473 
<212> DNA 
<213> Homo 



sapiens 



<400> 488 
agggaattac aatatnnctt tcnggnaagt 
gcttgccttt ggaaanganc cagggcaggg 
cgtcaagnaa ggcttgggan ggaaacttga 
cttgaagctn accgggggcc aaagcttgcc 
gggggaaggg aaccacccat ttgttcggga 
aacttgggga ttgaaaccaa gaaaaaatcc 
aagcttgggg aaccgccttg ggaaccgaag 
accggggagg gnccgggccg cccgggctta 



ccgggccaga gaaaagggna cattgcctgg 
gaaaagcttc ttgggangga aaacccttgc 
aagaaagctt gttgttcttt ccgaagaaag 
aagtaagnaa tatccccttg ggatccaggg 
ggaaagaata aggggaaacc aagcctttta 
ttgcccnaaa ggggaagaag ggaaagcttg 
aagttttgcc attttaagtt cccaagattt 
nncaagtggg acccccaccg gtt 



<210> 489 
<211> 512 
<212> DNA 
<213> Homo sapiens 



<400> 489 
agcttaccct tggcntttta agnttcccct 
gantnnagta gantctccnt tgttgttntt 
tttttctttt ttcctttggc ttttaaactt 
acagngnggc tttcaanccc cgaanggnaa 
gacttttccc angggaacnt caaaagccca 
tcgaggggac accacaagcc cagggggctt 
tgggactttg caagggggct tggggacaag 
aaattgcctt tgtcaaaccc acgttggggg 
attaaaagtt tgaaggggaa acacctcctt 



aacctntatn ggnggaaccc acctttattg 
tgaaaacccc anaaantttg gnaaaacnct 
tttggccccc ccgggggttt tcccaanana 
tggnaatccn naagtttcca acaccacntt 
agaagaangg ggcccaangg gacccaagct 
cttttccttc cgaaaacccc caaggggact 
aaggttgggg ggttgggggg gggaaaaagc 
ggaagcccca ctcccatttc ccaagggtgc 
gc 



<210> 490 
<211> 518 
<212> DNA 
<213> Homo 



sapiens 



<400> 490 
ttcntgaaat tgangaaatg ggccccttcn 
tgtntgcttt ccggggccct ntggngggng 
gcccnaancc ccaacccaag ggggaacccn 
aancttgaaa gggaagtttt gggaaaacac 
aagcntcaat cagccccgca ttcaaaacaa 
cggggaagtt gggtttttca agaagacatt 
tcaaaggatt tgactcatga agggaccttg 
gcctcttgaa aatttcccac accccaagcc 
ttgggcccca gggtggccac caaataaaaa 



gggccttcgc tngnangggg gttnttttct 
gggtntttgc caanncnttt ttggaaaagg 
aaanacgttt ttccagnggc nttnggaata 
acttgggnan ggaacaaagg gcttcgggga 
gaagtggaaa cttttcttgc caaagaatgc 
ttcaagaaaa agtggaaagg ggaagaagac 
aaagggggtg ggacatccca aggaaaaggg 
gccttgttgc ttgagggact ccctccattg 
aatcctac 



<210> 491 
<211> 344 
<212> DNA 
<213> Homo 



sapiens 



<400> 491 

acccatgcag gagacctctc caggtacaca tatttcctgc tactgaatgg cttagactgg 
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gatttgcaag gaactacgaa gtccaagacc tttgcctttc ttttagaaga aggcaccagc 120 

tggttctcca atgttgaagg tcttctccag agatgaactc tgaaagccac atgttgagat 180 

ggccccatta caggatggag agcacctgaa cccccaagtt atggactaga agaagacagt 240 

tgccctggaa aatcatctga cccacattgg actttatgtg agggggaaat aaacctttat 300 

tatgttaagc tacacaataa taaataacaa caataattgt gttt 344 



<210> 492 
<211> 381 
<212> DNA 
<213> Homo 



sapiens 



<400> 492 

tctccctgtc cttttnagtn cnccaaaact ngngggaaaa nctttnaaaa atatttctcc 60 

cngggnaaaa tgnggnggaa aagtccntgg cacntgnaat gggccccctt tgtanggaaa 120 

aaannaaccc caggggttcn tgggagttcc ncgaaccgtg gggnncnttg angggcncca 180 

angggaagaa aaaaccnccg tggaaaccct taattaaagt tttngggggg tggaagaaga 240 

agaaaaataa aaaccttaaa gtattgttaa agcttcttgt catttcaaag gggtaaatac 300 

caagttgtgg gaaagggcaa gaaaaaaaat ggacccactc tccccttgga tatccattaa 360 

aaaggatgtc ccaaaatcct c 381 



<210> 493 
<211> 639 
<212> DNA 
<213> Homo 



sapiens 



<400> 493 

tctgggggag cctaccttgc tttaacttcc tnaacttaaa ggtanaacaa cnccctnttt 60 

tnccntgaaa aacnanggcn tttttngaca ttaaagncnc ttttaaggag gtatgcccaa 120 

aaaaaggnaa ncccaacccc ttnngccaaa aaatnaaacn tcaaagangg ggcnggcnaa 180 

antcngggaa ncntttnccc caggggggaa gaagaatgaa cnctttttta ntggggcttt 240 

ncagaaaaag gtggnaaggt ccacttggct ttttggcttg gnctttggga atcaaaggaa 300 

ccnagaaaaa ggaaaattan ttggataccc aatggggaag ccttggaaga atgccatttt 360 

ggtttgggga agggtttttc ttgtcttcaa acttgggtct cttgacaaag cctcttgact 420 

tggaatggta ttcccgtggc ttgggccact tatgccaagc aaggcatcat taaatttaag 480 

acggggactt ggcttgcacc tttccttgaa gaaagccaag actttccact tggatgggaa 540 

agaagcttga aaaaaccacc aaagcccagg gaagtggcaa gaaccacttg gnccttaatt 600 

tgcttncttg aagaattncc attattaata aaaagaaaa 639 

<210> 494 
<211> 342 
<212> DNA 
<213> Homo sapiens 



<400> 494 

ntagcctcag gatggaggtg gctgccagaa agaccaagta atgatcagaa gcatggaact 60 

ttcagaccta ttcctcccaa cttctggaga gggngagtgc ctggagactg agttaataat 120 

tgatcacgtc tacatgatga aacctctaag tgacaaggat cagagagctt ccaagttggt 180 

gaatacatcc atgtgcaggg agggtggcct accctaaccc catcggacag gagcacccat 240 

gttcaggaat cttctggacc tcaccttatg tattaatctc tctttatctg gctgttcatc 300 

tatattcttc atagtatcct ttataataaa caagcaaatg tc 342 



<210> 495 
<211> 613 
<212> DNA 
<213> Homo sapiens 

<400> 495 

ntcntgaaac tggagttcgg ggtngtncna ttaattgggg aaatgggann ggggaaaaat 60 

aaaaatggaa ctgggaatgg gngccgcttn ctttttttaa agntttcaaa aaatgaccat 120 

ttnccaaaaa caaagcccgg gggccttgga ncccccgggc cttggttttt aaaaaatttt 180 

aacaaacanc aagttccttg ggggaaaggg ngggggaacc cacccaacct ttttctttga 240 

aataaacttg ggggaagaat gaaaaacaag ggaaagcttc ttattgaaca ccactttgga 300 
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<210> 496 
<211> 611 
<212> DNA 
<213> Homo sapiens 



<210> 497 
<211> 436 
<212> DNA 

<213> Homo sapiens 



360 
420 



atcggaaata ttgaacaaga acacccggaa aaaatcaacg aacttcaagc ccccttccaa 
gccaccttct tgccttgttt gccccgcccg aatcacaagc ccgggaatgg caagcttgaa ^v, 
aaagaattcc cttgggggcc cttgggntcc caaacccgcc cacttgtggg actcttgaag 480 
gccctcttgc atttgtgggg tggggtcttg ccttgtggat aatttttggg tcattggggc 540 
ttgggtcttg gtccgggntt ncccatnttg gtcttggccc aaggctctat ggtnggcttn 



600 



- - ~- a = = — as^^wv.-n. yyi-uyyoLUll DUU 

aaatcccttt ggc g 13 



60 



<400> 496 

tcannaaact ggagggacgg gncacgncaa ncganncccc tgggggggct ntttaaaaac bu 

tttttcaggg agcccttatg aaacaaaacc ccggggttgn gttanggnta ctngggctng 120 

ngtcccaccc nactgggttc ttttttttct tnttggggcc ccanaaatgg aagggggatt 180 

gccccaccaa ngggaccccc tttccaacca gaacccnngg gacttattat taaaccctnt 240 
tttttgcgcc cnaccattga atgggacttt gnaacccgca aaagcttgaa ggnccattcg 



300 



— s — — — ^ ^ x_ oca yyiiu^ctLUjy jUU 

gataccgccc taacccctta cccccgggga acaatctttc attgggaaaa acaagccggg 360 
ntttttttcc gactttttac aaagccttcc cggtngggct tgggaaggcc attcttaagc 
ttggcaagaa aaacaagcaa gggaaaggat gctttccggg ggaagccctt gatgccttga 
aaaatgaaaa aaattantct taaaggctat tcaaatatca agccaagcca tttttttcca 
nggagaaang gaaaaaaggc cgaanaaaaa aacaaatttt ccaanaatgg ggttgncttc 
cttccaaccc a 



420 
480 
540 
600 
611 



<400> 497 

gaacccaaaa gaatgcccag aatgccaaga acagtgaaca gccatatgca aacgggcaat 6 0 

actgatgtta gctttaaaag taaggagttc agagtgctct gtgctgaaca tctttcggtg 120 

taattaagcc ttcatattcc tgaggaggag ctactaagac accctaccaa gtcctgggct 

crtcrr:r!t-rrrr^i<-T <-rt- 1- ^> = = ^-r= = *- r, i — * . 



180 



_ _-,__ 3 - 3 _ ;3;3 ^ 3 vw».v.uu a av J noi.i,Lai.uaa yucctgcfgct leu 

gtgcctggag gttagaaaac gaaccacata gtcctgtaat gacagaaaaa aattgaaaac 240 

tgtattttaa aaatgatttc tcaacaagac cagccggcca ctcaaccact tcagtacctc 300 

gtttctggat gaagaccctg agcaggggat ttgcactaga aaccgccttg cagaagttgt 3 60 

catcattgtt gatgggcagc aggtctccgt gcacatctgc atagccaata gttacatcac 42 0 

tgttggagat atggtg 436 

<210> 498 
<211> 445 
<212> DNA 

<213> Homo sapiens 
<400> 498 

gttctgattg atnccnaggc tnttgaagta naccccacca tttaagccag agagggagat 60 
tnaagtggan atngcngcca cctattatnc cnngatatat ttggtatacn aacnaagaaa 120 
ctnaatnatn aattngacna tnaattttaa tfggaaaaggn aaaagnaaac nccagggggc 180 
cgggtggcaa tttgntttcc nttcttagtc ccttcaaaaa agtagaaaat agtgganatg 240 
aagcagggtt gatatgaatt tggcttgctt ccccccccaa tcttaccttt gcttgnaggt 300 
nccataatcc ccacatgtgg ggggaggaag cctttaggag gtgatttaat catggggtgg 360 
gtacccgcat gctgtctcat gataatgagt gagttctcca agaattaacg cttttatagg 420 
aacctttttc cccttttact tggcc 445 

<210> 499 
<211> 295 
<212> DNA 
<213> Homo sapiens 

<400> 499 

gttcttccca ttctggagta anaggatgtt gcnttnnaag ggtngtggga agggnncnan 60 
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aancttnccn ggantaangg cctaagggng gctttngacc aagggaccct ccaagtcaag 120 

gttcctttta catcacatat tgggaccccc aacagctggg cttcttcaag gtgagacaag 180 

acctgtggtt tgaatccacc atttaatggc tgngtgatca tgtgcaactt actcaacctc 240 

tcagagcctc aagtttcctc attaataaag tggagataat aatagaacac acctt 295 

<210> 500 
<211> 181 
<212> DNA 
<213> Homo sapiens 

<400> 500 

ggtttcctgg agttnggatt ttgctgactg cacactcacg gtgctatcca acatgancat 60 

cttccctgca gtttctacaa tttggcagtt ggatccacct gaatcctttg gcaaggccaa 120 

acgtggtgtc tnangaagaa cacattgaag tctctgtttt ttaaatatca ttatgacctt 180 

g 181 

<210> 501 
<211> 469 
<212> DNA 
<213> Homo sapiens 

<400> 501 

cagaaactga gatgaaagct ggggttggag atggagtttg tcattttntg ancttaaann 60 

naccngcntn ataacaaaag ccagcncacc ccanacngga gaatggaaag ggaggaaaaa 120 

tttgggtccc gtcttttaca agggntgntg agttacttca ccaatcctgg aatgctgatc 180 

ttttgggaac ttgttaaaca gtctttccac cccctttgtt cgaagctttt ggtgaagtgt 240 

ttcanaaact gacgaaatgc aggatcgttt tccttacaca cacaaatgcc atggcaacag 300 

caacttcgtg acaacagcaa agaaagccag actgggaatt tgccaaccca gagtggtgac 360 

catctgtgag ggcccaaacc cttcaaatgt tgccccgttc taaagtgctt atcttaaccc 420 

angcttttgt acatagcaaa agcgacattt aaagtgacat aagaatggg 469 

<210> 502 
<211> 400 
<212> DNA 
<213> Homo sapiens 

<400> 502 

tttttttcca attggggggg gaccaaattt tgnggggtna aattcccaaa tangggtggc 60 

cntttttttg ccttgggaac gacccatttg ggggggaaan ttaaaacccc ccccttnttt 120 

ggcnncnttg tntgnaaaag naaattggcc ccccggggcc ctttttttnc ccctttgggc 180 

caaaggggaa ttttttaaac cctttaaaaa attgggtntt ggccttgggg gaacctttgg 240 

cccaagaatg ggccccaaaa agngggnacc cccaataact nttanccccc tntttggcct 300 

tggttcaagc ncccaaaagg naaaanaaga ccctggrigtc nntttggggg aggtggggng 360 

gaaacccaaa atcccatttn gggggntttt ttttaaacct 400 

<210> 503 
<211> 185 
<212> DNA 
<213> Homo sapiens 

<400> 503 

ttgggggggg tttcccccaa acaaaaattt tcccgccttt tctttcagtt ggannggtgg 60 

ggagccccna atggaactta aaaatttctt gttggggggt tggggaggaa gaataaaaaa 120 

tgcccccttt nttnggggcc cttggacccc ttattttggc cccttgccca ttgcttgggc 180 

ccttg 185 
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